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Executive  Summary 


LMI  assessed  the  use  of  spectrophotometers  as  an  aid  to  military  shade  evaluators 
in  the  evaluation  of  military  fabrics.  Spectrophotometers  detect  and  quantify  color. 
Military  fabric  suppliers  currently  use  spectrophotometers  as  part  of  their  quality 
control  process;  however,  when  the  suppliers  ship  their  fabric  to  clothing  manufac¬ 
turers,  the  procedure  for  determining  whether  the  fabric  meets  the  military  shade 
standard  depends  solely  on  visual  evaluation.  Visual  evaluation  by  trained  shade 
evaluators  is  widely  considered  superior  to  instrumentation  because  the  human  eye 
is  more  sensitive  to  shade  variation.  This  is  especially  important  in  the  evaluation  of 
multicolored  camouflage  patterns. 

The  feedback  from  visual  evaluations  that  is  provided  to  fabric  suppliers  is  often 
general  and  vague.  Without  more  useful  feedback,  the  fabric  suppliers  have  diffi¬ 
culty  making  the  proper  adjustments  to  their  printing  and  dying  processes  to  meet 
the  given  shade  standard  for  a  specific  fabric  in  future  production.  Spectrophoto¬ 
meter  measurements  provide  more  quantitative  feedback  for  shade  evaluations, 
but  concerns  regarding  the  consistency  of  spectrophotometer  measurements  have 
raised  questions  about  the  use  of  these  measurements  in  government  shade  evalu¬ 
ations  and  the  feedback  they  provide  fabric  suppliers. 

This  study  addressed  two  main  questions: 

♦  Do  spectrophotometer  measurements  collected  at  different  locations  pro¬ 
duce  consistent  color  measurements? 

♦  Given  that  the  instruments  do  not  have  the  same  sensitivity  to  shade  as  the 
human  eye,  to  what  extent  do  human  visual  evaluations  agree  with  spec¬ 
trophotometer  evaluation? 

The  study  was  conducted  in  three  phases.  Phase  1  was  an  exploratory  study  to  eva¬ 
luate  the  feasibility  of  our  data  collection  strategy  and  validate  our  methodology.  In 
Phase  2,  spectrophotometer  measurements  of  color  tiles  and  fabrics  were  collected 
from  19  test  sites  to  see  if  the  measurements  were  consistent  with  one  another,  both 
within  and  across  test  sites.  With  only  a  few  exceptions,  these  measurements  were 
broadly  consistent. 
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Phase  3  involved  the  following: 


♦  Recruiting  a  panel  of  10  people  to  evaluate  whether  100  fabric  samples 
met  designated  color  shade  pass/fail  criteria. 

♦  Using  a  spectrophotometer  to  generate  pass/fail  decisions  on  these  same 
fabrics. 

♦  Obtaining  an  “official  determination”  from  experts  on  color  shades  to  as¬ 
certain  whether  the  samples  should  have  received  pass/fail  designations. 

♦  Comparing  the  pass/fail  designations  of  the  10  people  with  the  pass/fail 
decisions  generated  by  the  spectrophotometer  in  terms  of  how  often  these 
decisions  agreed  with  the  official  determination. 


On  average,  the  pass/fail  decisions  generated  by  the  spectrophotometer  were  as 
accurate  as  the  pass/fail  decisions  generated  by  the  10  people.  In  other  words,  a 
spectrophotometer  perfonns  just  as  well  as  a  human  with  good  color  vision  when 
judging  the  color  fabric  shades. 
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Chapter  1 

Introduction 


The  purpose  of  our  study  was  to  assess  the  viability  of  using  spectrophotometers 
to  evaluate  shade  conformance  of  military  fabrics.  Spectrophotometers  are  in¬ 
struments  that  measure  color  in  tenns  of  light  reflected  or  absorbed  in  different 
portions  of  the  spectrum  and  in  different  amounts  and  provide  a  numerical  value 
specific  to  that  color.  Many  military  fabric  suppliers  currently  use  spectrophoto¬ 
meters  as  part  of  their  quality  control  process;  however,  current  shade  acceptance 
of  military  textile  materials  to  a  shade  standard  is  strictly  based  on  visual  assess¬ 
ment  methods.  Visual  evaluation  by  trained  shade  evaluators  is  widely  considered 
superior  to  instrumentation  because  the  human  eye  is  more  sensitive  to  shade  var¬ 
iation,  which  is  especially  important  in  the  evaluation  of  multicolored  camouflage 
patterns. 

The  feedback  from  visual  evaluations  that  is  provided  to  fabric  suppliers  can,  at 
times,  be  vague  and  not  provide  sufficient  infonnation  to  correct  the  shade  in  the 
proper  direction.  Without  more  useful  feedback,  the  fabric  suppliers  have  difficul¬ 
ty  making  the  proper  adjustments  to  their  printing  and  dying  processes  to  meet  the 
given  shade  standard  for  a  specific  fabric  in  future  production.  Spectrophotometer 
measurements  provide  more  quantitative  feedback  for  shade  evaluations,  but  con¬ 
cerns  regarding  the  consistency  of  spectrophotometer  measurements  have  raised 
questions  about  the  use  of  these  measurements  in  government  shade  evaluations 
and  the  feedback  they  provide  fabric  suppliers. 

The  study  addressed  two  main  questions: 

♦  Do  spectrophotometer  measurements  collected  at  different  locations  pro¬ 
duce  consistent  color  measurements? 

♦  Given  that  the  instruments  do  not  have  the  same  sensitivity  to  shade  as  the 
human  eye,  to  what  extent  do  human  visual  evaluations  agree  with  spec¬ 
trophotometer  evaluation? 

If  it  can  be  shown  that  evaluations  by  spectrophotometers  agree  with  the  results  of 
a  government  visual  evaluation  at  least  as  often  as  evaluations  by  other  human 
shade  evaluators,  a  plausible  case  will  have  been  presented  for  incorporating  the 
use  of  spectrophotometer  measurements  somewhere  in  the  military  shade  evalua¬ 
tion  and  feedback  process. 

In  addition  to  the  primary  study,  an  ancillary  study  was  conducted  to  collect 
information  about  the  physical  attributes  and  instrumentation  used  in  the  vari¬ 
ous  shade  rooms  of  government  and  industry  facilities  that  conduct  visual  and 
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instrumental  shade  evaluation.  The  purpose  was  to  gain  insight  on  the  amount 
of  variation  in  characteristics  and  instrumentation  used  throughout  the  industry. 


Standards 


The  spectrophotometer  measurements  of  interest  in  this  study  were  based  on  stan¬ 
dards  commonly  used  by  government  and  industry.  These  standards  are  based  on 
the  International  Commission  on  Illumination  (CIE,  or  Commission  Internatio¬ 
nale  d' Eclair  age),  which  specifies  colors  for  instrumental  shade  evaluations  using 
three  axes  (see  Figure  1-1): 

♦  L*  =  light-dark  axis.  Measured  on  a  scale  of  0  to  100,  where  0  is  at  the 
darkest  end  of  the  scale,  and  100  is  at  the  lightest. 

♦  a*  =  red-green  axis.  Scale  with  midpoint  of  0  and  no  upper  or  lower  limit. 
Negatives  scores  on  this  scale  are  more  green,  and  positive  scores  are 
more  red.  The  shades  included  in  this  report  vary  from  -26.64  to  9.26  on 
this  scale. 

♦  b*  =  yellow-blue  axis.  Scale  with  midpoint  of  0,  with  no  upper  or  lower 
limit.  Negative  scores  on  this  scale  are  more  blue,  and  positive  scores  are 
more  yellow.  The  shades  included  in  this  report  vary  from  -20.56  to  18.43 
on  this  scale. 

Collectively,  these  dimensions  are  referred  to  as  the  CIELAB  (  CIE  L*a*b*). 


Figure  1-1.  CIELAB  Coordinate  System 
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-b 
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Introduction 


During  the  course  of  the  study,  two  sets  of  measurements  were  collected  based  on 
the  CIELAB  standard.  The  first  set  comprised  absolute  measurements  along  each 
of  the  three  color  dimensions  in  the  color  space  designated  as  L*,  a*  and  b*.  An 
absolute  measure  defines  the  color  coordinates  as  a  single  point  in  the  three- 
dimensional  axis  for  the  sample  evaluated.  The  second  set  of  measurements  iden¬ 
tified  how  far  the  shade  of  a  sample  differed  from  the  shade  standard  for  that 
sample’s  fabric  type.  Designated  as  AL*,  Aa*,  and  Ab*,  this  set  of  measurements 
defines  a  vector  in  the  three-dimensional  axis  that  specifies  the  color  difference 
between  the  sample  and  the  designated  shade  standard  evaluated.  For  example, 
measurements  from  the  sample  designated  as  the  standard  for  the  color  shade 
Black  472  are 

♦  17.04  for  L*, 

♦  0.45  for  a*,  and 

♦  -1.17  for  b*. 

If  another  sample  of  Black  472  was  measured  at  16.62  for  L*,  0.44  for  a*,  and 
-0.96  for  b*,  then  its  values  would  be 

♦  -0.42  for  AL*,  or  16.62  -  17.04; 

♦  -0.01  for  Aa*,  or  0.44  -  0.45;  and 

♦  0.21  for  Ab*,  or -0.96 -(-1.17). 

Purpose 

The  purpose  of  the  study  was  a  twofold  assessment  of  whether  there  is  a  correla¬ 
tion  between  instrument  shade  measurements  collected  at  government  and  indus¬ 
try  sites,  and  whether  there  is  a  correlation  between  instrument  pass/fail  results 
and  human  visual  inspection  pass/fail  results  for  a  representative  set  of  military 
fabrics.  To  make  this  assessment,  three  hypotheses  were  tested: 

♦  Hypothesis  1 :  There  is  correlation  among  government  and  industry  shade 
instrumentation  in  measuring  the  absolute  value  of  a  color. 

♦  Hypothesis  2:  There  is  correlation  among  government  and  industry  shade 
instrumentation  in  measuring  the  color  differences  between  samples. 

♦  Hypothesis  3:  There  is  correlation  between  instrumental  measurement  and 
human  observer  in  making  pass/fail  acceptance  decisions. 
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The  first  two  hypotheses  deal  with  whether  instrumentation  at  different  locations 
produce  consistent  shade  measurements.  This  issue  is  addressed  in  this  report  by 
seeking  the  answers  to  the  following  two  questions: 

♦  To  what  extent  do  different  locations  produce  consistent  absolute  values  of 
L*,  a*,  and  b*  for  different  color  shade  samples? 

♦  To  what  extent  do  different  locations  produce  consistent  color  difference 
measurements  between  shade  samples  and  a  designated  standard  for  a 
given  fabric?  That  is,  to  what  extent  do  AL*,  Aa*,  and  Ab*  vary  among 
locations? 

The  third  hypothesis  deals  with  the  extent  to  which  spectrophotometer  evaluations 
agree  with  human  evaluations.  This  issue  is  addressed  by  seeking  the  answers  to 
the  following  two  additional  questions: 

♦  To  what  extent  do  decisions  made  by  human  evaluators  on  a  given  set  of 
fabric  samples  as  compared  to  a  shade  standards  (pass/fail  decisions)  agree 
with  one  another  and  with  the  official  pass/fail  determination? 

♦  To  what  extent  do  spectrophotometer  measurement  decisions  on  a  given 
set  of  fabric  samples  as  compared  to  the  shade  standard  agree  with 
pass/fail  decisions  by  human  evaluators  and  with  the  official  pass/fail 
determination? 

Study  Approach 


We  assembled  a  team  to  design  a  study  and  then  monitor  and  provide  feedback 
on  its  conduct.  The  study  team  comprised  representatives  from  the  following 
organizations: 

♦  Defense  Logistics  Agency  (DLA)  Headquarters  (HQ),  Customer-Driven 
Unifonn  Manufacturing  II  (CDUM  II)  Program 

♦  Defense  Supply  Center  Philadelphia  (DSCP)  Clothing  and  Textiles  (C&T) 
Quality 

♦  Defense  Supply  Center  Columbus  (DSCC)  Product  Testing  Center  (PTC) 

♦  Army  Natick  Soldier  Research,  Development,  and  Engineering  Center 
(RDEC) 

♦  Government/Industry  Shade  Integrated  Product  Team  (IPT) 

♦  Pennsylvania  Apparel 

♦  Brookwood  Companies 
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Introduction 


♦  Performance  Textiles 

♦  LMI. 

The  study  was  conducted  in  three  phases. 

Phase  1  Overview 

In  Phase  1,  CIELAB  readings  were  taken  from  five  sample  sets  of  tiles  and  fa¬ 
brics  using  on-site  instrumentation  at  three  textile  printing  and  finishing  plants,  a 
color  shade  instrument  manufacturer,  and  Army  Natick  Soldier  RDEC. 

Phase  1  had  three  objectives: 

♦  Evaluate  the  ability  to  collect  the  instrumentation  data  at  the  five  sites. 

♦  Detennine  if  the  data  collected  are  statistically  sufficient  for  assessing 
whether  spectrophotometers  used  in  different  locations  produce  consistent 
color  measurements,  modifying  the  data  collection  method  if  necessary. 

♦  Explore  the  potential  for  additional  data  collection. 

Phase  2  Overview 

In  Phase  2,  data  was  collected  at  19  facilities  (the  original  5  plus  14  additional 
sites)  using  the  modified  data  collection  method  test  plan  derived  during  Phase  1 
on  tile  and  fabric  set  samples.  The  goal  of  Phase  2  was  to  determine  whether  dif¬ 
ferent  locations  produce  consistent  color  measurements. 

Phase  3  Overview 

Phase  3  detennined  the  extent  that  pass/fail  decisions  based  on  spectrophotometer 
measurements  agree  with  the  official  pass/fail  determination  and  with  human  shade 
evaluators. 

Ten  sets  of  solid  military  fabric  samples  from  10  different  fabrics  were  evaluated 
against  their  corresponding  shade  standard.  Shade  evaluators  at  Anny  Natick  Sol¬ 
dier  RDEC  and  DSCC  PTC  provided  the  official  visual  pass/fail  detennination  for 
each  set  of  samples  when  evaluated  against  the  corresponding  shade  standard.  A 
comparison  was  then  made  between  the  pass/fail  detenninations  of  human  evalua¬ 
tors  and  the  pass/fail  detennination  generated  by  a  spectrophotometer.  The  offi¬ 
cial  pass/fail  determination  was  used  to  compare  the  accuracy  of  the  two  sets  of 
evaluations. 
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Report  Organization 


The  remainder  of  this  report  is  organized  as  follows: 

♦  Chapter  2  details  the  methodology,  procedures,  and  findings  of  Phase  1. 

♦  Chapter  3  details  the  methodology,  data  analysis,  and  findings  of  Phase  2. 

♦  Chapter  4  details  the  methodology,  data  analysis,  and  findings  of  Phase  3. 

♦  Appendix  A  reports  the  finding  of  an  ancillary  study  conducted  to  deter¬ 
mine  if  factors  other  than  instrumentation  were  causing  variability  in 
readings  of  fabric  shade  parameters. 

♦  Appendix  B  provides  the  data  collection  protocols  for  ceramic  tiles,  shade 
fabric  samples,  solid  shade  on  different  fabric  samples,  fabric  pairs,  and 
camouflage  fabric  samples. 

♦  Appendix  C  defines  abbreviations  in  the  report. 
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Chapter  2 

Phase  1 


During  Phase  1 ,  CIELAB  measurements  were  collected  using  the  data  collection 
protocol  that  was  developed  by  textile  technologists  from  Anny  Natick  Soldier 
RDEC  (provided  in  Appendix  B).  Anny  Natick  Soldier  RDEC  used  the  American 
Association  of  Textile  Chemists  and  Colorists  (AATCC)  Evaluation  Procedure  6, 
Instrumental  Color  Measurement,  Section  2,  “Measurement  of  Color  by  Reflec¬ 
tance  Methods,”  as  a  guideline  for  the  color  measurement  of  the  samples. 

Five  sites — representing  government,  industry,  and  a  spectrophotometer 
manufacturer — were  selected  as  data  collection  targets.  Instrumentation  measure¬ 
ments  were  collected  from  these  five  sites,  which  are  listed  in  Table  2-1. 

Table  2-1.  Phase  1  Data  Collection  Sites 


Name 

Location 

Description 

Army  Natick  Soldier  RDEC 

Natick,  MA 

Government  shade  evaluation  lab 

HunterLab 

Reston,  VA 

Instrumentation  vendor 

Kenyon  Industries 

Kenyon,  Rl 

Industry  lab 

Polartec  LLC 

Lawrence,  MA 

Industry  lab 

Duro  Mills 

Fall  River,  MA 

Industry  lab 

On-site  personnel  followed  protocol  procedures  to  capture  the  spectrophotometer 
measurements  of  the  samples  provided.  A  textile  technologist  from  Army  Natick 
Soldier  RDEC  was  present  at  each  site  to  ensure  the  spectrophotometers  were 
properly  calibrated  and  samples  were  properly  conditioned,  and  to  observe  the 
instrumentation  data  collection  for  each  set  of  tile  and  fabric  samples. 

Phase  1  Samples 


Spectrophotometer  measurements  were  collected  on  the  following  sample  sets  at 
each  site: 

♦  A  standardized  set  of  color  ceramic  tiles 

♦  A  set  of  different  shade  fabric  samples 

♦  A  solid  shade  set  of  different  fabrics 
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♦  A  set  of  fabric  pairs 

♦  A  set  of  Army  combat  uniform  (ACU)  and  Air  Force  airman  battle  uni¬ 
form  (ABU)  camouflage  samples. 


Table  2-2  lists  the  material  and  source  of  each  sample  measured  during  Phase  1. 

Table  2-2.  Phase  1  Study  Samples 


Type 

Material 

Description 

Source 

Standardized  color 

Ceramic 

Deep  Blue 

Natick 

tiles 

Ceramic 

Deep  Grey 

Natick 

Ceramic 

White 

Natick 

Ceramic 

Mid  Grey 

Natick 

Ceramic 

Green 

Natick 

Ceramic 

Pale  Grey 

Natick 

Single  shade  (Blue 

Polyester/wool  elastique 

Blue  450.4 

DSCC  PTC 

450.4) — different 
fabric  substrates 

Polyester 

Blue  450.4 

DSCC  PTC 

Polyester/wool  tropical 

Blue  450.4 

DSCC  PTC 

Polyester/wool  serge 

Blue  450.4 

DSCC  PTC 

Fabric  pairs 

NA 

Black  472 

DSCC  PTC 

NA 

Forest  Green  504 

DSCC  PTC 

NA 

Purple  3905 

DSCC  PTC 

NA 

CG  483 

DSCC  PTC 

Army  ACU 

Nylon/cotton  twill 

Army  ACU  A 

Natick 

camouflage  samples 

Nylon/cotton  twill 

Army  ACU  B 

Natick 

Nylon/cotton  twill 

Army  ACU  C 

Natick 

Nylon/cotton  twill 

Army  ACU  D 

Natick 

Nylon/cotton  twill 

Army  ACU  E 

Natick 

Nylon/cotton  twill 

Army  ACU  F 

Natick 

Air  Force  ABU 

Nylon/cotton  twill 

Air  Force  ABU  A 

DSCC  PTC 

camouflage  samples 

Nylon/cotton  twill 

Air  Force  ABU  B 

DSCC  PTC 

Nylon/cotton  twill 

Air  Force  ABU  C 

DSCC  PTC 

Nylon/cotton  twill 

Air  Force  ABU  D 

DSCC  PTC 

Nylon/cotton  twill 

Air  Force  ABU  E 

DSCC  PTC 

Nylon/cotton  twill 

Air  Force  ABU  F 

DSCC  PTC 

Note:  Natick  =  Army  Natick  Soldier  RDEC. 
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Phase  1 


Phase  1  Findings 


The  data  generated  from  Phase  1  showed  that  the  data  collection  methodology 
required  a  redesign.  Those  findings  were  presented  to  the  study  working  group 
on  October  29,  2008.  A  set  of  recommendations  was  put  forth  for  the  redesigned 
methodology  to  be  followed  in  Phase  2. 

Observations  from  Phase  1  Data 

♦  Only  basic  graphical  analysis  (line  graphs  of  averages  for  each  color  fabric 
and  tile  combination  by  site)  of  the  ACU  and  ABU  samples  was  possible 
with  the  existing  data. 

♦  Spectrophotometer  readings  appeared  to  vary  substantively  from  site 
to  site. 

♦  The  observations  of  any  given  color  (fabric  or  tile)  combination  were  too 
few  to  detennine  if  the  variation  in  measurements  was  site-specific  factors 
or  random  measurement  error.  More  than  30  observations  of  a  color-fabric 
or  tile  combination  are  desirable,  and  a  minimum  of  18  observations  was 
determined  to  be  necessary  to  be  able  to  sort  out  these  factors. 

♦  The  samples  of  color  (fabric  or  tiles)  were  too  many  to  test  sufficiently  for 
systematic  variation  in  any  given  sample  set. 

Recommendations 

♦  Increase  the  number  of  observations  of  a  color  (fabric  or  tile)  sample  to 
18  per  site. 

♦  Reduce  the  number  of  color  (fabric  or  tile)  combinations  to  a  core  set  of 
colors  that  are  commonly  used  by  the  armed  forces. 

♦  Use  graphical  analysis,  t-tests,  analysis  of  variance  (ANOVA),  and  regres¬ 
sion  testing  by  LMI  statistical  staff  to  examine  Hypotheses  1  and  2. 

♦  Consolidate  the  number  of  color  (fabric  or  tile)  combinations: 

>  L*,  a*,  and  b*  testing — six  tiles  and  three  solid  fabrics 

>  AL*,  Aa*,  and  Ab*  testing — two  sets  of  fabrics,  one  solid  and  one 
camouflage  pattern. 
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Chapter  3 

Phase  2 


Based  upon  the  findings  and  data  collection  during  Phase  1,  the  data  collection 
procedures  and  sample  sets  were  modified  for  Phase  2.  Although  the  procedures 
remained  relatively  unchanged,  the  number  of  readings  and  samples  increased. 

Eighteen  (rather  than  the  initial  three)  readings  were  taken  for  each  tile  and  solid 
fabric  sample  during  Phase  2.  The  number  of  solid  color  fabric  samples  was  also 
changed.  Due  to  data  collection  time  constraints  that  would  be  encountered  with 
the  additional  15  measurements  per  sample,  only  three  of  the  fabrics  examined 
during  Phase  1 1  were  selected  from  which  to  collect  instrument  measurements: 

♦  Coyote  Green  498  ♦  Desert  Sand  503  ♦  Army  Green  49 1 . 

The  data  collection  protocol  was  modified  to  collect  measurements  of  2  sample 
sets  with  19  fabric  samples  each.  The  data  collection  procedure  was  modified 
with  the  following  changes: 

♦  One  measurement  of  the  “standard”  and  one  measurement  of  each  of  the 
1 8  fabric  swatches  within  a  sample  set  were  taken. 

♦  The  fabric  used  in  the  sample  sets  were  Black  472  and  Air  Force  ABU 
Camouflage  (Grey). 

♦  The  standard  and  each  sample  were  measured  to  determine  the  values  of 
L*,  a*,  b*,  AL*,  Aa*,  and  Ab*. 

Phase  2  Samples 


Table  3-1  lists  the  material  and  source  of  each  sample  measured  during  Phase  2. 

Table  3-1.  Phase  2  Study  Samples 


Sample  # 

Type 

Description 

Source 

1 

Ceramic  tile 

Deep  Blue 

Natick 

2 

Ceramic  tile 

Deep  Grey 

Natick 

3 

Ceramic  tile 

White 

Natick 

4 

Ceramic  tile 

Mid  Grey 

Natick 

5 

Ceramic  tile 

Green 

Natick 

6 

Ceramic  tile 

Pale  Grey 

Natick 

1  Colors  such  as  Forest  Green  504,  Purple  3905,  CG483,  and  Blue  450  were  eliminated. 
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Table  3-1.  Phase  2  Study  Samples 


Sample  # 

Type 

Description 

Source 

7 

Solid  shade  fabric 

Coyote  Green  498 

DSCC  PTC 

8 

Solid  shade  fabric 

Army  Green  491 

DSCC  PTC 

9 

Solid  shade  fabric 

Desert  Sand  503 

DSCC  PTC 

10 

Black  472  fabric 

Black  472  “Standard” 

DSCC  PTC 

11-28 

Black  472  fabric 

Black  472  samples  1-18 

DSCC  PTC 

29 

Air  Force  ABU  Camouflage  (Grey) 

ABU  “Standard” 

DSCC  PTC 

30-47 

Air  Force  ABU  Camouflage  (Grey) 

ABU  samples  1-18 

DSCC  PTC 

Note:  Natick  =  Army  Natick  Soldier  RDEC. 


Phase  2  Data  Collection 


The  data  collected  at  the  19  tests  sites  are  listed  in  Table  3-2.  Those  19  sites  were  se¬ 
lected  as  representative  of  government,  spectrophotometer  makers,  and  the  industry 
that  produces  unifonn  fabric  for  the  U.S.  military.  With  19  sites,  there  was  sufficient 
variation  in  site-relevant  factors  to  study  systematic  inter-site  variation  and  site- 
specific  factors  (e.g.,  maintenance  of  spectrophotometer  and  calibration  settings). 

Table  3-2.  Phase  2  Data  Collection  Sites 


Name 

Location 

Description 

Army  Natick  Soldier  RDEC 

MA 

Government  shade  evaluation  lab 

DSCC  Product  Testing  Center 

PA 

Government  shade  evaluation  lab 

U.S.  Coast  Guard  Shade  Lab 

MA 

Government  shade  evaluation  lab 

U.S.  Navy  C&T  Research  Lab 

MA 

Government  shade  evaluation  lab 

Data  Color 

NJ 

Instrumentation  vendor 

HunterLab 

VA 

Instrumentation  vendor 

X  Rite 

NC 

Instrumentation  vendor 

Bondcote 

VA 

Industry  lab 

Brittany 

MA 

Industry  lab 

Burlington  Finishing  Plant 

NC 

Industry  lab 

Burlington  Raeford 

NC 

Industry  lab 

Carlisle 

SC 

Industry  lab 

Crystal  Springs 

GA 

Industry  lab 

Duro  Mills 

MA 

Industry  lab 

Kenyon  Industries 

Rl 

Industry  lab 

Milliken 

SC 

Industry  lab 

Mount  Vernon 

GA 

Industry  lab 

Polartec  LLC 

MA 

Industry  lab 

W.L.  Gore 

MD 

Industry  lab 
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Phase  2 


When  presenting  actual  data  or  analyzed  data  specific  to  a  site,  a  Greek  letter  was 
randomly  assigned  (Alpha  through  Zeta)  in  place  of  the  actual  site  name.  This  was 
done  to  preserve  the  anonymity  of  the  site  providing  the  data. 

Additional  Corollaries 

The  primary  purpose  of  Phase  2  was  to  detennine  if  CIELAB  scores  from  differ¬ 
ent  spectrophotometers  at  different  sites  were  correlated.  This  study  also  looked 
into  how  CIELAB  readings  varied  by  the  size  of  aperture  (large  or  small)  used  in 
capturing  spectrophotometer  measurements.  This  led  to  the  following  corollaries 
to  test  in  the  analysis: 

♦  Corollary  l.a — Spectrophotometers  produced  by  different  manufacturers 
will  produce  different  values  of  L*,  a*,  and  b*. 

♦  Corollary  1  .b — Spectrophotometers  produced  by  different  manufacturers 
will  produce  different  values  of  AL*,  Aa*,  and  Ab*. 

♦  Corollary  2. a — Spectrophotometers  with  different  lens  aperture  sizes  will 
produce  readings  of  L*,  a*,  and  b*  that  correlate  differently. 

♦  Corollary  2.b — Spectrophotometers  with  different  lens  aperture  sizes  will 
produce  values  of  AL*,  Aa*,  and  Ab  that  correlate  differently. 

To  again  preserve  the  anonymity  of  the  spectrophotometer  manufacturers,  each 
manufacturer  is  represented  by  a  letter  from  the  Hebrew  alphabet:  Aleph,  Beth, 
Gimel,  or  Daleth. 

Analytical  Methods 

We  analyzed  the  data  collected  from  the  19  sites  using  Microsoft  Excel  2003  and 
Statistical  Package  for  the  Social  Sciences  (SPSS)  version  17.  We  employed  a 
variety  of  statistical  analysis  methods  to  examine  the  two  hypotheses  and  four 
corollaries.  We  then  produced  tables  and  graphs  to  illustrate  the  variations  in 
mean  and  standard  deviation  for  sample  tiles  and  fabrics,  sites,  and  spectrophoto¬ 
meters.  We  also  generated  correlation  matrices  in  order  to  examine  the  strength 
and  patterns  of  correlations  for  CIELAB  L*  a*  and  b*  readings  between  test  sites. 
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Data  Analysis 

Hypothesis  1 

We  examine  Hypothesis  1  first: 

Hypothesis  1 :  There  is  correlation  among  government  and  industry  shade 
evaluation  instrumentation  values  of  military  fabric  for  the  absolute  value 
of  color  shades — L*,  a*,  and  b*. 

Central  Tendency  of  Absolute  Measures  (L*,  a*,  and  b*)  of  Solid  Color 
Tiles  and  Fabrics 

When  we  wish  to  summarize  any  distribution  of  numbers,  the  one  characteristic  of 
the  distribution  that  we  are  often  interested  in  is  its  central  tendency.  The  central 
tendency  of  a  distribution  is  its  central  or  most  characteristic  value.  The  most  wide¬ 
ly  used  measures  of  central  tendency  are  the  mean  (or  average  value),  the  median 
(or  the  middle  value),  and  the  mode  (the  value  that  occurs  most  often).  For  the  pur¬ 
poses  of  this  study,  we  used  the  mean  as  our  measure  of  central  tendency. 

Table  3-3  shows  the  mean  values  of  L*,  a*,  and  b*  for  each  type  of  material 
sample  obtained  from  18  of  the  19  sites.  One  site  is  not  included  here  because  we 
did  not  have  data  from  that  site  for  all  of  the  sample  sets. 

Table  3-3.  L*,  a*,  and  b*  by  Type  of  Material  Sampled 


Type  of  material  sampled 

L* 

a* 

b* 

Deep  Blue  Tile 

28.3981 

8.4949 

-19.3166 

Green  Tile 

55.7443 

-26.1381 

13.3189 

Pale  Grey  Tile 

83.7194 

-0.3753 

0.5739 

Mid  Grey  Tile 

59.9678 

-0.4884 

-0.1982 

Deep  Grey  Tile 

36.1034 

-0.1510 

0.2656 

White  Tile 

95.6810 

-0.2501 

2.6837 

Army  Green  491 

24.2242 

-3.6765 

1.5410 

Coyote  498 

41.1680 

5.4887 

17.3420 

Desert  Sand  503 

67.4196 

2.2124 

10.3663 

Note:  Each  number  is  an  average  of  324  observations — 18  observations  at  18  of  19  sites. 


Table  3-3  is  useful  because  it  shows  how  the  scores  vary  for  the  different  colors. 
For  example,  the  deep  blue  tile  has  the  largest  reading  on  the  blue  end  of  the  b* 
(yellow-blue)  scale,  exactly  as  we  would  expect.  Similarly,  the  white  tile  has  the 
largest  reading  on  the  L* (dark-light)  scale  with  a  score  near  the  largest  possible 
value  of  100. 
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Phase  2 


We  also  used  the  scores  in  Table  3-3  as  benchmarks  in  assessing  inter-site  disper¬ 
sion  of  color  measurements.  If  the  color  measurements  from  an  individual  site 
vary  significantly  from  the  mean  measurements  from  all  sites,  we  assessed  wheth¬ 
er  the  measurements  from  that  site  are  divergent. 

Dispersion  of  Absolute  Measures  (L*,  a*,  and  b*)  for  Solid  Color 
Tiles  and  Fabrics 

In  addition  to  central  tendency,  the  other  characteristic  of  a  distribution  we  are 
often  interested  in  is  dispersion.  In  addition  to  a  distribution’s  most  characteristic 
value,  we  also  wish  to  know  how  tightly  or  sparsely  the  other  values  are  spread 
around  that  characteristic  value.  This  gives  us  a  sense  of  how  the  other  numbers  in 
that  distribution  vary  from  that  characteristic  value.  In  the  present  study,  the  real 
issue  for  Phase  2  is  not  what  the  absolute  measurements  L*,  a*,  and  b*  are,  but  to 
what  extent  they  vary  from  site  to  site  and  even  within  sites  themselves.  Our  first 
step  was  to  examine  overall  variation  in  the  scores  to  see  if  some  types  of  samples 
had  more  variation  than  other  types  of  samples.  There  are  three  measures  of  dis¬ 
persion  that  we  use  in  this  study:  the  range,  the  standard  deviation,  and  a  compari¬ 
son  between  the  subgroup  mean  and  the  overall  mean  for  a  given  measurement.  In 
this  context,  the  subgroups  are  the  individual  test  sites. 

The  range  is  simply  the  distance  between  the  lowest  score  and  the  highest  score. 
The  range  is  useful  because  it  is  easy  to  understand  and  because  it  gives  a  sense  of 
how  much  total  variation  exists  in  a  sample.  The  disadvantage  of  the  range  is  that 
outliers  can  give  a  distorted  sense  of  how  much  variation  exists  in  a  sample.  To 
complement  the  range,  we  use  the  standard  deviation.  The  standard  deviation  is 
somewhat  complicated,"  but  the  idea  is  to  measure  how  much,  on  average,  each 
observation  varies  from  the  mean  of  the  sample.  The  standard  deviation  is  the 
most  widely  used  indicator  of  dispersion  among  statisticians. 

When  comparing  subgroup  means  to  overall  means,  we  conclude  that  there  is  dis¬ 
persion  among  subgroups  if  the  absolute  value  of  the  difference  between  these 
means  is  greater  than  some  given  value.  For  example,  if  the  overall  mean  is  30.3 
and  we  use  0.5  as  our  given  value,  we  might  conclude  that  a  subgroup  mean  was 
divergent  if  it  was  either  below  29.8  or  above  30.8. 

To  show  how  much  observations  vary  by  sample  type,  we  used  the  standard  devi¬ 
ation.  For  variation  within  a  site,  we  used  the  range  because  even  a  single  aberrant 
reading  from  a  given  site  is  important  and  should  be  noted.  For  inter-site  varia¬ 
tion,  we  used  the  difference  between  the  site  mean  and  the  mean  for  all  sites. 


2  The  formula  for  the  sample  standard  deviation  is  the  square  root  of  the  sum  of  the  squared 
deviations  from  the  mean  divided  by  the  number  of  observations  minus  one. 
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Figure  3-1  shows  standard  deviations  for  L*,  a*,  and  b*  by  type  of  color  sample.3 
Among  the  L*  measurements  Desert  Sand  503  has  the  highest  level  of  dispersion. 
Also,  note  that  the  three  color  samples  with  the  highest  measurements  of  L*  on 
Table  3-3  also  show  the  highest  level  of  dispersion  in  Figure  3-1.  It  appears  that 
colors  with  a  higher  absolute  value  of  L*  also  tend  to  have  more  divergent  L* 
measurements. 

The  a*  measurements  have  relatively  little  dispersion  overall.  We  found  the  most 
dispersion  in  a*  measurements  for  the  deep  blue  tile  and  the  green  tile.  These  tiles 
also  have  the  highest  and  lowest  a*  values  in  Table  3-3. 

Among  the  b*  measurements  the  white  and  pale  grey  tiles  had  the  highest  levels 
of  dispersion  in  their  measurements.  There  is  also  little  evidence  of  a  relationship 
between  the  absolute  values  of  b*  in  Table  3-3  and  the  dispersion  of  b*  measure¬ 
ments  shown  in  Figure  3-1,  with  the  possible  exception  of  the  deep  blue  tile.  The 
deep  blue  tile  has  the  lowest  b*  measurement  on  Table  3-3,  and  the  third-highest 
level  of  dispersion  for  b*  in  Figure  3-1. 

Figure  3-1.  Standard  Deviations  for  L*,  a*,  and  b*  by  Type  of  Sample 


491 


3  L*,  a*,  and  b*  are  shown  on  the  same  graph  only  for  convenience.  Since  the  scales  are  dif¬ 
ferent,  the  standard  deviations  across  scales  are  not  comparable. 
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Inter-Site  Dispersion  of  Absolute  Measures  (L*.  a*,  and  b*)  for  Solid  Color 
Tiles  and  Fabrics 

Table  3-4  gives  summary  results  of  cross-site  dispersion  for  the  six  color  tile 
samples  and  the  three  solid  color  fabric  samples  in  this  study  for  which  values  of 
L*,  a*  and  b*  are  available.  Keep  in  mind  the  actual  site  name  has  been  replaced 
with  a  random  letter  of  the  Greek  alphabet  to  maintain  anonymity.  Table  3-4  indi¬ 
cates  which  color  samples  at  each  site  had  readings  that  were  more  than  0.5  above 
or  below  the  mean  values  of  readings  recorded  from  all  sites. 

Table  3-4.  Color  Samples  with  L*,  a*,  and  b*  Readings  at  an  Individual  Site 
More  Than  0.5  Above  or  Below  Mean  for  All  Sites 


Site 

L* 

a* 

b* 

Alpha 

Desert  Sand  503 

Beta 

Chi 

Army  Green  491 

Delta 

Epsilon 

lota 

Kappa 

Lambda 

Mu 

Desert  Sand  503 

Omega 

Desert  Sand  503 

Deep  Blue 

Deep  Blue  Tile 

Deep  Blue  Tile 

Omicron 

Coyote  498 

Desert  Sand  503 

Psi 

Desert  Sand  503 

Rho 

All  Samples 

All  Samples 

Sigma 

Tau 

Desert  Sand  503 

Deep  Grey  Tile 

Deep  Blue  Tile 

Deep  Blue  Tile 

Coyote  498 

Deep  Grey  Tile 

Deep  Blue  Tile 

Theta 

Upsilon 

Desert  Sand  503 
Coyote  498 

Green  Tile 

Coyote  498 

Deep  Blue  Tile 

Green  Tile 

Xi 

Zeta 
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Several  things  stand  out  in  Table  3-4: 


♦  L*  and  b*  measurements  from  site  Rho  differ  significantly  from  all  other 
sites  for  all  color  samples  in  the  study.  No  other  site  had  divergent  mea¬ 
surements  for  more  than  three  colors;  site  Rho  had  divergent  L*  and  b* 
measurements  for  all  nine  color  samples. 

♦  There  is  less  evidence  of  dispersion  overall  for  a*  measurements  than  for 
L*  and  b*  measurements. 

♦  Some  color  samples  showed  evidence  of  more  dispersion  than  others.  Just 
as  we  saw  in  Figure  3-1,  Desert  Sand  503  in  particular  appears  to  have  a 
great  deal  of  dispersion  in  its  L*  readings.  Coyote  498  and  the  deep  blue 
tile  also  show  more  dispersion  than  the  other  color  samples  throughout  all 
of  the  color  dimensions.  Overall,  there  appears  to  be  less  dispersion  on 
readings  of  color  tiles  than  fabrics. 

Within-Site  Dispersion  of  Absolute  Measures  (L*,  a*,  and  b*) 

Table  3-5  gives  summary  results  of  within-site  dispersion  by  site  for  the  six  color 
tile  samples  and  the  three  solid  color  fabric  samples  in  this  study  for  which  values 
of  L*,  a*  and  b*  are  available. 

Table  3-5.  Sites  with  L*,  a*,  and  b*  Readings  with  a  Range  0.3  or  Larger 


Site 

L* 

a* 

b* 

Alpha 

Desert  Sand  503 

Beta 

Chi 

Delta 

Desert  Sand  503 

Epsilon 

Army  Green  491 

lota 

Kappa 

Lambda 

Desert  Sand  503 

Mu 

Omega 

Omicron 

Desert  Sand  503 

Psi 

Rho 

Sigma 

Green  Tile 

Tau 

Deep  Blue  Tile 

Theta 

Upsilon 

Xi 

Zeta 
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The  table  indicates  which  site  reported  readings  of  a  given  sample  type  that  had  a 
range  of  0.3  or  larger.  Recall  that  each  site  reported  18  different  L*,  a*,  and  b* 
readings  for  the  6  color  tile  samples  and  the  3  solid  color  fabric  samples.  The 
range  is  the  distance  between  the  largest  and  the  smallest  of  the  18  readings. 
Overall,  we  find  very  little  inter-site  dispersion  for  these  samples.  The  one  major 
exception  is  b*  readings  of  Desert  Sand  503;  four  different  sites  reported  sets  of 
readings  with  a  range  of  0.3  or  greater.  This  is  somewhat  surprising  given  that  we 
did  not  observe  an  unusually  large  standard  deviation  for  b*  readings  of  Desert 
Sand  503  in  Figure  1-1.  This  points  to  the  possibility  that  the  dispersion  in  the  b* 
readings  of  Desert  Sand  503  is  caused  by  a  few  outliers.  (We  revisit  this  issue  at 
the  end  of  this  chapter.) 

Summary  of  Central  T endency  and  Dispersion  of  Absolute  Measures 
(L*,  a*,  and  b*) 

Three  main  points  should  be  taken  from  this  section  of  our  analysis: 

♦  In  general,  spectrophotometers  at  the  19  sites  generated  similar  absolute 
measures  (L*,  a*,  and  b*)  for  the  color  tile  and  fabric  samples  included  in 
this  study.  The  one  exception  was  site  Rho,  which  consistently  produced 
L*  and  b*  readings  that  diverged  significantly  from  the  readings  produced 
at  the  other  18  sites.  We  therefore  find  that  hypothesis  1  is,  for  the  most 
part,  confirmed. 

♦  With  one  major  exception,  there  was  relatively  little  within-site  dispersion 
of  the  absolute  measures  (L*,  a*  and  b*).  That  exception  was  the  Desert 
Sand  503  fabric.  Four  sites  reported  divergent  sets  of  b*  readings  for  that 
fabric.  With  this  one  exception,  we  have  high  confidence  that  the  sites  are 
generating  consistent  readings. 

Correlations  of  Absolute  Measures  (L*,  a*,  and  b*)  for  Solid  Color 
Tiles  and  Fabrics 

Having  examined  central  tendency  and  dispersion  in  the  absolute  measures 
(L*,  a*,  and  b*),  we  next  examined  to  what  extent  these  measures  are  correlated 
with  one  another.  The  correlation  coefficient  “r”  indicates  to  what  extent  two  sets 
of  observations  are  related  to  one  another.  This  coefficient  has  a  range  of  possible 
values  from  -1  to  1.  The  standard  that  we  will  use  in  this  study  to  interpret  the 
correlation  coefficient  is  as  follows:4 

♦  -1.0  to  -0.5,  strong  negative  association 

♦  -0.49  to  -0.3,  moderate  negative  association 

♦  -0.29  to  -0.1,  small  negative  association 

4  Jessica  Steele,  Choosing  the  Correct  Statistical  Test,  www.radford.edu/~jcsteele/Choosing 
the  Correct  Statistical  Test,  2006. 
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♦  -0.09  to  0.09,  no  association 

♦  0.1  to  0.29,  small  positive  association 

♦  0.3  to  0.49,  weak  positive  association 

♦  0.5  to  1.0  strong  positive  association. 

Figure  3-2,  Figure  3-3,  and  Figure  3-4  show  the  correlations  matrices  for  the 
mean  site  L*,  a*,  and  b*  tile  measurements.  All  three  of  the  matrices  indicate  an 
extremely  high  degree  of  correlation  among  the  mean  measurements  from  all 
sites.  This  means  that  even  if  there  are  small  differences  in  reported  measure¬ 
ments  between  the  sites,  these  differences  remain  highly  stable  across  color  tile 
samples. 

An  illustration  of  the  high  degree  of  correlation  across  color  tile  samples  can  be 
seen  in  Figure  3-5  by  examining  the  graph  of  the  mean  L*  color  tile  measure¬ 
ments  from  sites  Alpha  and  Rho.  Recall  from  Table  3-4,  which  showed  that  L* 
measurements  from  site  Rho  were  consistently  different  from  measurements  from 
other  sites  across  color  sample  type.  Despite  the  discrepancy,  there  is  still  a  very 
close  correspondence  between  the  mean  L*  measurements  from  these  sites. 

Figure  3-6,  Figure  3-7,  and  Figure  3-8  show  the  correlation  matrices  for  the  mean 
L*,  a*,  and  b*  site  measurements  for  the  color  fabric  samples  Anny  Green  491, 
Coyote  498,  and  Desert  Sand  503.  Once  again,  we  see  an  extremely  high  degree  of 
correlation  for  all  these  measurements  across  sites.  We  can  conclude  this  section  of 
our  analysis  by  stating  that  despite  a  limited  amount  of  across-site  and  within-site 
dispersion  in  absolute  L*,  a*,  and  b*  of  six  color  tile  samples  and  three  fabric  sam¬ 
ples,  we  still  find  a  high  degree  of  correlation  across  sites  for  these  measurements. 

Hypothesis  1 — Findings 

We  find  there  is  a  high  degree  of  correlation  among  industry  color  shade  evalua¬ 
tion  instrumentation  values  of  military  fabric  for  the  absolute  value  of  color 
shades  L*,  a*,  and  b*,  as  shown  in  Figures  3-2-3-8. 
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Figure  3-2.  Correlation  Matrix  for  L*  Site  Means  of  Color  Tiles 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

171 

0 

0 

Median  Correlation 

Mean  Correlation 

0.999996 

0.999992 

□  small  □  weak  ^  strong  association 

Figure  3-3.  Correlation  Matrix  for  a*  Means  of  Color  Tiles 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

171 

0 

0 

Median  Correlation 

Mean  Correlation 

0.99998 

0.999934 

□  small  □  weak  m  strong  association 
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Figure  3-4.  Correlation  Matrix  for  b*  Means  of  Color  Tiles 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

171 

0 

0 

Median  Correlation 

Mean  Correlation 

0.999942 

0.999713 

□  small  □  weak  ^  strong  association 


Figure  3-5.  L*  Means  for  Color  Tile  Samples  from  Sites  Alpha  and  Rho 
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Figure  3-6.  Correlation  Matrix  for  L*  Means  of  Color  Fabrics 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

153 

0 

0 

Median  Correlation 

Mean  Correlation 

0.999996 

0.999991 

□ sma|1  □  weak  m  strong  association 


Figure  3-7.  Correlation  Matrix  for  a*  Means  of  Color  Fabrics 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

153 

0 

0 

Median  Correlation 

Mean  Correlation 

0.999948 

0.999888 

□  small  □  weak  m  strong  association 
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Figure  3-8.  Correlation  Matrix  for  b*  Means  of  Color  Fabrics 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

153 

0 

0 

Median  Correlation 

Mean  Correlation 

0.999712 

0.999267 

□ sma|1  □  weak  m  strong  association 


Hypothesis  2 

It  is  now  appropriate  to  examine  Hypothesis  2: 

Hypothesis  2\  There  is  correlation  among  government  and  industry  color 
shade  evaluation  instrumentation  measurements  of  the  difference  of  color 
shade  values — AL*,  Aa*,  and  Ab* — of  different  military  fabric  samples  in 
comparison  to  a  “standard”  sample. 

In  order  to  test  Hypothesis  2,  we  conducted  additional  analysis  using  the  Air  Force 
Grey  shade  of  the  Air  Force  ABU  camouflage  pattern  and  Black  472  solid  shade. 
The  data  for  these  sample  sets  were  collected  in  a  different  manner.  Two  data  sets 
(one  for  each  fabric  type)  consisted  of  a  single  “standard”  fabric  sample  and  1 8  oth¬ 
er  samples.  Rather  than  taking  18  measurements  of  each  fabric  sample,  we  col¬ 
lected  a  single  measurement  for  each  of  the  19  fabric  samples  contained  in  the  two 
sample  sets.  In  addition,  since  the  grey  shade  of  the  ABU  camouflage  pattern  did 
not  allow  for  the  use  of  a  large  aperture  view  in  collecting  its  instrumentation  mea¬ 
surements,  all  sites  collected  their  ABU  grey  measurements  using  a  small  aperture 
opening.  Replicating  the  statistical  analyses  perfonned  for  the  nine  samples  above 
could  help  us  understand  what  a  “normal”  set  of  data  might  look  like  and  the  impli¬ 
cations  that  would  have  for  Hypothesis  1 . 

Mean  Values  of  Absolute  Measures  (L*,  a*,  and  b*)  and  Difference 
Measures  (AL*,  Aa*,  and  Ab*)  for  Air  Force  Grey  and  Black  472 

Table  3-6  gives  the  means  and  standard  deviations  of  the  absolute  measures 
(L*,  a*,  and  b*)  and  the  difference  measures  (AL*,  Aa*,  and  Ab)  for  the  samples 
of  Air  Force  Grey  and  Black  472  used  in  this  study.  Notice  the  standard  devia¬ 
tions  for  the  measurements  of  the  Black  472  samples  are  consistently  smaller  than 
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the  standard  deviations  for  the  measurements  of  the  Air  Force  Grey  samples.  This 
indicates  there  is  more  variation  among  the  Air  Force  Grey  samples  than  there  is 
among  the  Black  472  samples.  For  this  reason  alone,  differences  among  the  Air 
Force  Grey  samples  should  be  easier  to  detect  than  differences  among  the  Black 
472  samples.  The  Aa*  scores  for  the  Black  472  samples  appear  to  be  particularly 
problematic.  Table  3-6  indicates  that  the  average  difference  between  the  Aa*  des¬ 
ignated  as  the  standard  and  all  of  the  other  Black  472  samples  is  0.001  with  a 
standard  deviation  of  only  0.086.  With  differences  this  small,  we  may  run  into 
problems  finding  correlations  between  Aa*  scores  among  the  different  sites.  Cor¬ 
relation  essentially  means  that  one  set  of  scores  tends  to  change  in  direct  propor¬ 
tion  to  the  change  in  a  second  set  of  scores.  If  a  sample  does  not  exhibit  a 
sufficient  amount  of  variation,  then  it  becomes  more  difficult  to  demonstrate  cor¬ 
relation  in  that  sample. 

Table  3-6.  L*,  a*,  b*,  AL*,  Aa*,  and  Ab*  Means  and  Standard  Deviations 
for  Air  Force  Grey  and  Black  472 


Material  sampled  and  statistic 

L* 

a* 

b* 

AL* 

Aa* 

Ab* 

Air  Force  Grey  Mean 

50.27 

0.15 

5.75 

0.234 

0.234 

0.694 

Air  Force  Grey  Standard  Deviation 

1.45 

0.44 

0.75 

1.414 

0.455 

0.599 

Black  472  Mean 

16.36 

0.36 

-1 .51 

-0.5273 

0.001 

0.119 

Black  472  Standard  Deviation 

0.52 

0.12 

0.51 

0.484 

0.086 

0.163 

Table  3-7  displays  mean  AL*,  Aa*,  and  Ab*  scores  for  the  Air  Force  Grey  and 
Black  472  samples  by  site.  Means  shaded  in  yellow  were  more  than  0.5  above  or 
below  the  mean  difference  scores  for  all  sites.  Means  shaded  in  red  were  more  than 
1  above  or  below  the  mean  difference  scores  for  all  sites.  Mean  site  AL*  scores  for 
Air  Force  Grey  samples  were  more  likely  than  other  measure  to  diverge  from  the 
mean  scores  of  other  sites.  This  is  consistent  with  Table  3-6,  where  the  AL*  scores 
for  Air  Force  Grey  had  the  highest  standard  deviation.  Two  sites  also  reported  di¬ 
vergent  Ab*  scores  for  Air  Force  Grey.  This  is  again  consistent  with  Table  3-6, 
where  Ab*  scores  for  Air  Force  Grey  had  the  second  largest  standard  deviation. 

The  AL*  and  Ab*  scores  from  two  sites  in  particular,  Omega  and  Omicron,  were 
much  more  divergent  from  the  mean  difference  scores  reported  by  other  sites. 
Recall  from  Table  3-4  that  these  two  sites  had  some  divergent  values  for  some 
of  the  absolute  measures  (L*,  a*,  and  b*)  of  the  solid  color  samples  as  well. 
Surprisingly,  AL*,  Aa*,  and  Ab*  scores  from  site  Rho  differed  little  from  the 
AL*,  Aa*,  and  Ab*  reported  from  other  sites,  despite  the  fact  that  absolute 
measures  L*,  a*,  and  b*  for  solid  color  samples  from  site  Rho  had  shown  a 
large  degree  of  divergence. 
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Table  3-7.  Difference  Scores  for  Air  Force  Grey  and  Black  472 


Air  Force  Grey 

Site 

AL* 

Aa* 

Ab* 

Alpha 

0.03 

0.29 

0.56 

Beta 

-0.39 

0.23 

0.45 

Chi 

-0.46 

0.08 

0.45 

Delta 

-0.39 

-0.03 

0.54 

Epsilon 

0.04 

-0.17 

0.77 

lota 

0.06 

0.31 

0.63 

Kappa 

0.37 

0.26 

0.63 

Lambda 

0.13 

0.86 

Mu 

0.03 

0.21 

0.60 

Omega 

0.11 

1.44 

Omicron 

0.72 

1.45 

Psi 

0.01 

0.26 

0.52 

Rho 

0.00 

0.28 

0.61 

Sigma 

0.07 

0.23 

0.56 

Tau 

-0.05 

0.23 

0.56 

Theta 

0.04 

0.28 

0.61 

Upsilon 

0.02 

0.39 

0.69 

Xi 

-0.03 

0.32 

0.58 

Zeta 

0.16 

0.33 

0.69 

Mean 

0.23 

0.23 

0.69 

Black  472 

AL* 

Aa* 

Ab* 

-0.47 

-0.02 

0.14 

-0.66 

0.04 

0.06 

-0.77 

0.05 

0.03 

-0.37 

0.02 

0.22 

-0.53 

0.03 

0.20 

-0.64 

0.04 

0.06 

-0.28 

-0.05 

0.13 

-0.62 

0.03 

0.02 

-0.82 

0.00 

0.16 

-0.02 

0.02 

0.08 

-0.36 

-0.03 

0.14 

-0.75 

0.06 

0.13 

-0.59 

-0.01 

0.12 

-0.55 

-0.08 

0.18 

-0.52 

-0.01 

0.17 

-0.66 

0.01 

0.12 

-0.20 

-0.07 

0.09 

-0.83 

-0.01 

0.10 

-0.52 

-0.03 

0.12 

-0.53 

0.00 

0.12 

Note:  Each  number  is  an  average  of  18  observations  per  site,  except  Upsilon,  which 
has  1 7  observations.  Values  shaded  in  red  are  more  than  1 .0  above  or  below  the  average 
of  other  sites.  Values  shaded  in  yellow  are  more  than  0.5  above  or  below  the  average  of 
other  sites. 

Correlations  of  Difference  Measures  (AL*,  Aa*,  and  Ab*)  for  Air 
Force  Camouflage  Grey  and  Solid  Black  472  Samples 

Figure  3-9,  Figure  3-10,  and  Figure  3-11  display  the  correlation  matrices  for  the 
difference  measures  AL*,  Aa*,  and  Ab*  of  the  Air  Force  Grey  samples.  Just  as  in 
the  correlation  matrices  for  the  solid  color  tiles  and  samples,  the  measurements 
from  the  various  sites  of  the  Air  Force  Grey  samples  tended  to  be  highly  corre¬ 
lated  with  one  another. 
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While  the  correlations  represented  in  Figure  3-9,  Figure  3-10,  and  Figure  3-11  are 
nearly  all  strong,  they  are  not  nearly  as  strong  as  the  correlations  for  the  measure¬ 
ments  of  the  solid  color  tiles  and  fabrics  shown  earlier.  This  is  due  to  the  differing 
research  design  used  to  gather  shade  measurement  differences  versus  absolute  shade 
measurements.5 

One  thing  that  we  do  not  notice  in  Figure  3-9,  Figure  3-10,  or  Figure  3-11  are  low 
correlations  between  difference  measures  AL*,  Aa*,  and  Ab*  reported  from  sites 
Omega  and  Omicron  and  the  difference  measures  reported  from  the  other  sites. 
Even  though  AL*  and  Ab*  scores  of  Air  Force  Grey  samples  from  sites  Omega 
and  Omicron  diverged  from  the  scores  of  the  rest  of  the  sites,  changes  in  the 
scores  from  sites  Omicron  and  Omega  still  tended  to  track  closely  with  the 
changes  reported  by  other  sites. 

Figure  3-9.  Correlation  Matrix  of  AL*  Scores  for  Air  Force  Grey 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

171 

0 

0 

Median  Correlation 

Mean  Correlation 

0.968225 

0.939517 

□  small  □  weak  ^  strong  association 


5  Absolute  shade  measurement  data  collection  of  the  solid  color  tiles  and  fabrics  captured  1 8 
measurements  each  of  a  single  sample  from  each  solid  color  type.  Because  we  took  so  many  mea¬ 
surements  of  each  sample  type,  any  within-site  variation  of  measurements  was  most  likely  going  to 
be  averaged  out.  Shade  difference  measurement  data  collection,  on  the  other  hand,  used  1  standard 
sample  and  18  other  samples  for  each  of  the  sample  types  that  were  only  measured  once.  This  means 
divergent  individual  measurements  would  not  be  averaged  out.  Also,  since  each  site  was  measuring  a 
total  of  19  different  samples,  just  1  divergent  measurement  out  of  19  could  have  a  significant  effect 
on  the  correlation  of  that  site’s  measurements  with  those  of  other  sites.  Given  this  increased  potential 
for  error  in  the  difference  measures,  we  should  not  expect  the  near-perfect  correlations  of  around 
0.999  that  we  saw  in  shade  absolute  measurements  of  the  solid-color  samples. 
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Figure  3-10.  Correlation  Matrix  of  Aa*  Scores  for  Air  Force  Grey 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

167 

2 

2 

Median  Correlation 

Mean  Correlation 

0.960632 

0.838751 

□  small  □  weak  ^  strong  association 


Figure  3-11.  Correlation  Matrix  of  Ab*  Scores  for  Air  Force  Grey 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

170 

1 

0 

Median  Correlation 

Mean  Correlation 

0.960729 

0.886589 

□  small  □  weak  ^  strong  association 


Figure  3-12  shows  the  correlation  matrix  for  the  AL*  scores  reported  by  the  sites 
for  the  samples  of  the  solid  Black  472  fabrics.  All  of  these  correlations  are  strong 
with  the  exception  of  the  correlation  between  AL*  scores  from  site  Omicron  and 
AL*  scores  from  the  other  sites.  This  is  surprising  since  the  mean  AL*  score  from 
site  Omicron  shown  in  Table  3-7  was  not  significantly  different  from  the  mean 
AL*  score  reported  from  the  other  sites. 
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Figure  3-12.  Correlation  Matrix  for  AL*  Scores  of  Black  472 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

153 

14 

3 

Median  Correlation 

Mean  Correlation 

0.895097 

0.831739 

□  small  □  weak  ^  strong  association 


The  reason  for  the  low  correlations  for  site  Omicron  can  be  seen  in  Table  3-8. 

There  was  one  aberrant  measurement  from  site  Omicron  that  had  a  disproportionate 
effect  on  the  correlations  between  the  L*  scores  from  site  Omicron  and  the  Black 
472  L*  scores  from  the  other  sites.  This  measurement  of  21.18  is  circled  in  Table 
3-8.  If  this  single  measurement  is  replaced  by  the  average  measurement  of  that 
sample  from  the  other  sites,  the  recalculated  correlations  between  the  Black  472  L* 
scores  from  Omicron  and  those  from  the  other  sites  all  become  strong. 


Table  3-8.  L*  Measurements  of  Black  472  Samples 


Alpha  1 

Beta  1 

Chi  I 

Delta  | 

Epsilon 

lota  | 

Kappa  1 

Lambda 

Mu  1 

Omega 

Omicron 

psi  I 

Rho  | 

Sigma 

1  Tau  1 

Theta  1 

Upsilon  | 

X  1 

Zeta  1 

Standard 

16.84 

17.01 

17.24 

16.38 

16.67 

17  1 

16.51 

18.11 

17.29 

16.73 

17.04 

17.01 

16.32  | 

17.15 

16.73  | 

17.02 

16.55 

17.77  | 

16.76 

Sample  1 

16.25 

16.36 

16.64 

15.93 

15.98 

16.39 

16.13 

17.57 

16.67 

16.67 

16.37 

16.35 

15.63 

16.8 

16.5 

16.45 

16.12 

16.96 

16.19 

Sample  2 

16.49 

16.45 

16.35 

16.03 

16.28 

16.51 

16.40 

17.65 

16.61 

16.64 

16.55 

16.44 

15.82 

16.76 

16.37 

16.45 

16.39 

17.08 

16.39 

Sample  3 

15.94 

16.02 

16.08 

15.59 

15.66 

16.06 

15.74 

17.18 

16.15 

16.36 

15.94 

15.93 

15.54 

16.24 

16.02 

16.07 

15.84 

16.67 

15.83 

Sample  4 

16.28 

16.31 

16.25 

15.93 

16.09 

16.24 

16.23 

17.34 

16.19 

16.97 

16.36 

16.07 

15.74 

16.49 

15.91 

16.32 

16.32 

16.81 

16.07 

Sample  5 

16.43 

16.36 

16.50 

15.98 

15.9 

16.33 

16.26 

17.61 

16.36 

16.78 

16.55 

16.29 

15.81 

16.68 

16.21 

16.36 

16.42 

16.99 

16.26 

Sample  6 

16.55 

16.46 

16.78 

16.24 

16.22 

16.61  j 

16.50 

17.67 

16.64 

16.94 

16.77 

16.49 

15.97 

16.75 

16.35 

16.67 

16.66 

17.14 

16.45 

Sample  7 

16.05 

16.10 

15.90 

15.50 

15.78 

15.98 

15.93 

16.99 

16.02 

16.12 

16.03 

15.9 

15.45 

16.21 

15.36 

15.92 

16.01 

16.63 

15.87 

Sample  8  16.04  16.08  16.22  15.74  I  15.69  I  15.95  15.89  17.13  I  16.19  I  16.23  16.13  15.94  I  15.40  I  16.31  I  15.79  I  16.02  I  16.03  16.61  16 

Sample  9  15.95  15.84  15.89  15.56  15.72  15.79~T  15.75  16.97  15.95  16.32  15.85  |  15.74  15.16  |  15.99  15.47  15.82  15.88  16.39  15.79 


Sample  10 

16.43 

16.26 

16.45 

16.08 

16.2 

16.28 

16.36 

17.48 

16.38 

17.21 

16.39 

16.34 

15.78 

16.55 

16.23 

16.34 

16.38 

16.98 

16.29 

Sample  11 

16.54 

16.47 

16.42 

16.23 

16.33 

16.27 

16.28 

17.47 

16.39 

16.82 

16.42 

16.22 

15.79 

16.55 

16.18 

16.33 

16.40 

16.92 

16.25 

Sample  12 

16.88 

16.79 

17.00 

16.56 

16.68 

16.79 

16.80 

18 

16.75 

18.24 

16.85 

16.66 

16.19 

16.98 

16.89 

16.76 

16.88 

17.43 

16.76 

Sample  13 

16.06 

16.22 

16.32 

15.90 

16.02 

16.16 

16.03 

17.19 

16.23 

16.22 

16.13 

15.97 

15.50 

16.31 

16.23 

16.10 

16.15 

16.7 

15.94 

Sample  14 

17.61 

17.33 

17.09 

17.14 

17.34 

17.01 

16.82 

18.32 

17.14 

17.34 

17.07 

16.94 

16.72 

17.25 

16.94 

16.93 

17.66 

17.57 

17.04 

Sample  15 

16.28 

16.48 

16.51 

16.09 

16.2 

16.52 

16.25 

17.76 

16.71 

16.80 

16.66 

16.33 

15.75 

16.81 

16.36 

16.60 

16.34 

16.93 

16.44 

Sample  16 
Sample  17 

16.13 

16.40 

16.01 

16.37 

16.61 

16.78 

15.71 

16.22 

16.35 

15.75 

16.42 

16.63 

16.00 

16.40 

17.45 

17.65 

16.56 

16.78 

16.08 

16.55( 

21.18 

^  16.2 
J6.43 

15.48 

15.86 

16.67 

16.73 

16.29 

16.54 

16.40 

16.62 

16.07 

16.46 

17.02 

17.15 

16.05 

16.44 

Sample  18 

16.39 

16.36 

16.67 

15.83 

16.06 

16.48 

16.39 

17.43 

16.72 

16.54  V 

— 16.62  ^ 

1^16.38 

15.68 

16.77 

16.15 

16.45 

16.34 

16.99 

16.2 

Figure  3-13  shows  the  correlation  matrix  for  the  Aa*  scores  reported  by  the  sites  for 
the  samples  of  the  Black  472  fabrics.  Unlike  all  of  the  other  correlation  matrices 
that  we  have  seen  up  to  this  point,  this  correlation  matrix  shows  a  large  number  of 
correlations  that  are  weak,  small,  or  even  nonexistent.  Furthennore,  even  the  strong 
correlations  in  Figure  3-13  are  not  generally  as  strong  as  the  correlations  we  saw  in 
Figure  3-12  for  the  AL*  scores  of  the  black  472  samples.  The  mean  and  median 
correlations  for  the  Aa*  scores  are  only  0.553  and  0.5 18,  compared  to  0.895  and 
0.832  for  the  AL*  scores.  A  likely  cause  for  the  correlations  could  be  the  relative 
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lack  of  variation  of  the  Aa*  scores  of  the  Black  472  fabric  samples.  Recall  from 
Table  3-6  that  the  Aa*  varied  within  a  very  narrow  range  around  a  mean  of  0.001.  It 
appears  the  range  of  variation  of  the  Aa*  scores  is  so  narrow  as  to  be  close  to  the 
limits  of  what  the  spectrophotometers  can  detect.  Confirmation  of  this  hypothesis 
can  be  seen  in  Table  3-9.  The  sites  in  Figure  3-13  that  have  the  lowest  correlations 
also  tend  to  have  the  highest  standard  deviations  in  Table  3-9.  It  appears  that,  at 
least  for  sites  Epsilon  and  Lambda,  it  is  difficult  to  distinguish  actual  variation  in 
the  measurements  from  measurement  error. 

Figure  3-13.  Correlation  Matrix  for  Aa*  Scores  for  Black  472 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

89 

40 

23 

Median  Correlation 

Mean  Correlation 

0.552624 

0.51837 

□  small  □  weak  ^  strong  association 


Table  3-9.  Standard  Deviations  of  Aa*  Scores  of  Black  472  Samples  by  Site 


Site 

Standard  deviation 

Alpha 

0.075 

Beta 

0.047 

Chi 

0.057 

Delta 

0.063 

Epsilon 

0.139 

lota 

0.040 

Kappa 

0.042 

Lambda 

0.120 

Mu 

0.046 

Omega 

0.147 

Site 

Standard  deviation 

Omicron 

0.067 

Psi 

0.056 

Rho 

0.045 

Sigma 

0.093 

Tau 

0.048 

Theta 

0.075 

Upsilon 

0.066 

Xi 

0.051 

Zeta 

0.059 

Figure  3-14  show  the  correlation  matrix  for  the  Ab*  scores  reported  by  the  sites 
for  the  samples  of  the  Black  472  fabrics.  These  correlations  are  quite  high  with 
the  exception  of  the  correlations  between  the  Ab*  scores  from  site  Omega  and  the 
scores  reported  from  the  other  sites.  The  low  correlations  for  site  Omega  are 
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somewhat  puzzling.  There  is  no  indication  from  Table  3-7  of  anything  out  of  the 
ordinary  about  the  Ab*  scores  of  the  Black  472  fabric  samples  from  site  Omega. 
One  possibility  is  within-site  variation  of  the  Ab*  scores  in  site  Omega. 

Figure  3-14.  Correlation  Matrix  for  Ab*  Scores  for  Black  472 


Strong  Associations  (>.5) 

Weak  Associations 

.3  to  .5) 

Small  Assocations  (.1  to  .3) 

152 

13 

6 

Median  Correlation 

Mean  Correlation 

0.915242 

0.818749 

□  small  □  weak  ^  strong  association 


Table  3-10  shows  the  standard  deviations  of  the  Ab*  scores  of  the  Black  472 
samples  by  site.  The  standard  deviations  from  sites  Sigma  and  Omega  are  higher 
than  those  from  the  other  sites,  yet  the  correlations  of  scores  from  site  Sigma  and 
the  scores  from  the  other  sites  are  still  high.  We  have  no  other  explanation  for 
these  low  correlations  from  site  Omega  for  the  Ab*  scores. 


Table  3-10.  Standard  Deviations  of  Ab*  Scores  of  Black  472  Samples  by  Site 


Site 

Standard  deviation 

Alpha 

0.140 

Beta 

0.153 

Chi 

0.127 

Delta 

0.162 

Epsilon 

0.199 

lota 

0.137 

Kappa 

0.149 

Lambda 

0.152 

Mu 

0.145 

Omega 

0.214 

Site 

Standard  deviation 

Omicron 

0.153 

Psi 

0.149 

Rho 

0.150 

Sigma 

0.215 

Tau 

0.144 

Theta 

0.165 

Upsilon 

0.136 

Xi 

0.129 

Zeta 

0.149 
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Hypothesis  2 — Findings 

Overall,  we  find  that  spectrophotometer  machines  at  different  sites  tend  to  report 
AL*,  Aa*,  and  Ab*  readings  that  are  reasonably  close  to  one  another.  The  caveats 
to  this  overall  conclusion  are  as  follows: 

♦  AL*  and  Ab*  readings  of  Air  Force  Grey  from  sites  Omega  and  Omicron 
tended  to  diverge  significantly  from  the  AL*  and  Ab*  readings  recorded  at 
other  sites. 

♦  Readings  of  AL*,  Aa*,  and  Ab*  across  sites  were  more  highly  correlated 
for  the  Air  Force  Grey  Fabrics  than  they  were  for  the  Black  472  fabrics. 
We  found  the  weakest  correlation  among  the  Aa*  scores  of  the  Black  472 
fabrics.  This  lack  of  correlation  appeared  to  be  related  to  a  lack  of  varia¬ 
tion  in  the  Aa*  scores  of  the  Black  472  fabrics. 

Corollary  l.a 

Having  tested  the  hypotheses  1  and  2,  we  now  turn  to  the  corollaries.  The  first 
corollary  we  examine  is  Corollary  La: 

Corollary  l.a:  Spectrophotometers  produced  by  different  manufacturers 
will  produce  different  values  of  L*,  a*,  and  b*. 

Central  T endency  and  Dispersion  of  Absolute  Measures  (L*,  a*,  and  b*) 
of  Solid  Color  T iles  and  Fabrics  by  Manufacturer 

Table  3-11  gives  summary  results  by  device  manufacturer  for  the  six  color  tile 
samples  and  the  three  solid  color  fabric  samples  in  this  study  for  which  values  of 
L*,  a*,  and  b*  are  available. 

Table  3-11.  Color  Samples  by  Manufacturer  That  Varied  by  More  Than  0.3 
from  the  Mean  Readings  from  All  Manufacturers 


Spectrophotometer  brand 

L* 

a* 

b* 

Aleph 

Daleth 

Coyote  498 

Army  Green  491 

Army  Green  491 
Desert  Sand  503 

Deep  Blue  Tile 

Gimel 

Beth 

Deep  Blue  Tile 
Deep  Grey  Tile 
Coyote  498 

Desert  Sand  503 

White  Tile 

Deep  Blue  Tile 

Pale  Grey  Tile 

Mid  Grey  Tile 

Deep  Grey  Tile 
Coyote  498 
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Table  3-11  shows  the  number  of  color  samples  for  which  the  mean  measurements 
from  a  particular  spectrophotometer  manufacturer  were  more  than  0.3  above  or 
below  the  mean  values  reported  by  all  manufacturers.  We  use  the  value  of  0.3  ra¬ 
ther  than  0.5  because,  in  no  instance,  were  the  reported  mean  values  from  particu¬ 
lar  spectrophotometer  manufacturers  more  than  0.5  above  or  below  the  mean  for 
all  models  for  either  L*,  a*,  or  b*.  Highlights  of  this  table  are  as  follows: 

♦  Machines  manufactured  by  Gimel  and  Aleph  tend  to  have  mean  readings 
closest  to  one  another.6 

♦  The  Beth  machines  have  the  most  divergent  readings  for  L*  and  b*.  In 
particular,  the  Beth  machines  had  divergent  b*  readings  for  five  of  the  six 
tile  color  samples.  Further  manual  inspection  of  the  data  (not  shown)  re¬ 
vealed  that  these  divergent  readings  came  mostly  from  the  Beth  machine 
at  site  Tau.  Therefore,  it  may  be  site-specific  factors  at  site  Tau  rather  than 
the  Beth  machine  that  is  causing  the  divergent  readings. 

♦  The  Daleth  machine  had  two  out  of  nine  divergent  readings  for  L*  and  a*. 
In  both  cases,  these  were  readings  from  fabric  samples. 

We  will  now  look  at  how  the  standard  deviations  of  the  scores  for  L*,  a*,  and  b* 
vary  by  the  brand  of  spectrophotometer  (see  Figure  3-15,  Figure  3-16,  and 
Figure  3-17). 7 


6  Sixteen  of  the  19  spectrophotometers  in  this  study  were  either  Gimel  or  Aleph  models.  Since 
the  readings  from  these  two  models  tend  to  agree  with  one  another,  the  readings  from  these  two 
models  have  a  disproportionate  effect  on  the  overall  mean  for  all  manufacturers.  The  fact  that  the 
Daleth  machine  and  the  two  Beth  machines  produce  readings  that  diverge  from  this  mean  does  not 
necessarily  imply  anything  about  the  relative  accuracy  among  the  different  brands. 

7  There  was  only  one  Daleth  spectrophotometer.  This  reduces  the  potential  for  variation  on  the 
Dalet  brand  significantly  below  that  of  other  brands.  As  a  result,  no  inferences  should  be  drawn 
about  the  former  Daleth’s  brand’s  performance  relative  to  the  other  brands  based  upon  these  graphs. 
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Figure  3-15.  Standard  Deviations  of  Absolute  L*  by  Spectrophotometer  Maker 

and  Type  of  Sample 


— *—  Aleph  -■-Daleth  -±-Gimel  -*-Beth 


491 


Figure  3-16.  Standard  Deviations  for  Absolute  a*  by  Spectrophotometer  Maker 

and  Type  of  Sample 


-♦-Aleph  Daleth  -A-Gimel  -»-Beth 


491 
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Figure  3-17.  Standard  Deviations  of  Absolute  b*  by  Spectrophotometer  Maker 

and  Type  of  Sample 


— »- Aleph  -■-Daleth  -±- Gimel  -»-Beth 


Deep  Green  Pale  Grey  Mid  Grey  Deep  White  Tile  Army  Coyote  Desert 
Blue  Tile  Tile  Tile  Tile  Grey  Tile  Green  498  Sand  503 

491 


Highlights  from  Figure  3-15,  Figure  3-16,  and  Figure  3-17  include  the  following: 

♦  Sites  with  Aleph  spectrophotometers  showed  the  most  dispersion  in  L* 
and  b*  measurements.  The  high  level  of  dispersion  of  L*  and  b*  readings 
of  the  White  and  Pale  Grey  tiles  we  observed  in  Figure  3-1  appears  to  be 
caused  by  the  divergent  readings  from  the  Aleph  machines.  The  greater 
divergence  in  L*  measurements  from  Aleph  machines  was  limited  to  col¬ 
ors  with  the  highest  L*  scores  (White  and  Pale  Grey).  The  greater  diver¬ 
gence  in  b*  measurements  from  Aleph  machines  was  seen  for  seven  of  the 

o 

nine  colors.  An  analysis  of  variance  (not  shown)  comparing  the  disper¬ 
sion  of  measurements  within  Aleph  sites  to  the  dispersion  of  measurement 
across  Aleph  sites  indicates  that  nearly  all  of  the  variation  observed  for 
measurements  from  Aleph  machines  was  between  Aleph  sites.  In  other 
words,  readings  from  a  particular  Aleph  machine  showed  little  dispersion, 
while  readings  from  different  Aleph  machines  had  a  tendency  to  diverge 
from  one  another  for  L*  and  b*  measurements. 


8  This  finding  of  greater  dispersion  for  L*  and  b*  scores  from  Aleph  machines  may  seem  to 
contradict  our  earlier  finding  from  Table  3-11  that  average  readings  from  Aleph  and  Gimel  ma¬ 
chines  were  the  most  similar  to  average  readings  of  all  the  machines.  While  the  individual  Aleph 
machines  are  producing  divergent  readings,  when  these  divergent  readings  are  averaged,  the  re¬ 
sulting  average  tends  to  match  the  average  of  the  readings  from  the  Gimel  machines.  This  is  a 
good  example  of  the  importance  of  looking  at  measures  of  central  tendency  (means)  as  well  as 
dispersion  (standard  deviation).  Measures  of  central  tendency  give  us  a  sense  of  what  value  the 
scores  tend  to  cluster  around,  while  measures  of  dispersion  give  us  a  sense  of  how  tightly  or  loose¬ 
ly  those  values  cluster  around  that  mean  value. 
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♦  Sites  with  Gimel  spectrophotometers  generally  showed  a  similar  pattern  of 
dispersion  in  their  CIELAB  measurements  of  color  tiles  to  those  of  the 
Beth  machines.  On  the  other  hand,  sites  with  Gimel  machines  generally 
reported  more  dispersed  L*  and  b*  readings  of  color  fabrics  than  did  sites 
with  Beth  machines.  Further  inspection  of  the  data  (not  shown)  indicated 
that  the  unusually  high  standard  deviation  for  Gimel  L*  scores  of  Desert 
Sand  503  was  caused  primarily  by  divergent  readings  from  site  Omicron. 

♦  The  White  tile  and  the  three  shades  of  grey  tiles  all  showed  low  levels  of 
dispersion  in  a*  readings  from  all  sites.  These  four  color  shades  also  had 
a*  values  closest  to  zero  on  Table  3-3. 

Central  T endency  and  Dispersion  of  Absolute  Measures  (L*,  a*,  and  b*) 
of  Black  472  and  Air  Force  Grey  by  Manufacturer 

Table  3-12  shows  the  means  and  standard  deviations  for  the  samples  of  Black  472 
and  Air  Force  Grey  by  spectrophotometer  manufacturer.  None  of  the  means  in  the 
table  is  more  than  0.3  above  or  below  the  overall  means  for  Air  Force  Grey  or 
Black  472  shown  in  Table  3-6.  One  surprising  result  in  Table  3-12  is  that  the 
standard  deviations  for  CIELAB  scores  of  Black  472  and  Air  Force  Grey  from 
Gimel  machines  tend  to  be  higher  than  the  standard  deviations  for  Aleph  or  Beth 
machines.  Gimel  machines  for  the  most  part  showed  no  such  tendency  in  the  mea¬ 
surements  of  the  solid  color  samples. 

Table  3-12.  Means  and  Standard  Deviations  of  Air  Force  Grey  and  Black  472 

by  Manufacturer 


Black  472 

L* 

a* 

b* 

16.29 

0.35 

-1.49 

0.40 

0.13 

0.33 

16.38 

0.35 

-1.59 

0.61 

0.12 

0.71 

16.58 

0.46 

-1.32 

0.51 

0.06 

0.15 

Air  Force  Grey 

Scanner  maker 

L* 

a* 

b* 

Aleph  mean 

50.30 

0.17 

5.49 

Aleph  standard  deviation 

1.28 

0.39 

0.76 

Gimel  mean 

50.21 

0.11 

6.03 

Gimel  standard  deviation 

1.70 

0.49 

0.71 

Beth  mean 

50.34 

0.16 

5.81 

Beth  standard  deviation 

1.26 

0.42 

0.47 

Note:  Each  number  is  an  average  of  observations  taken  by  spectrophotometers  of  that  brand.  There 
were  144  observations  for  Aleph,  126  for  Gimel,  and  36  for  Beth.  There  are  no  L*,  a*,  or  b*  measurements 
for  the  site  with  the  Daleth  machine.  No  means  are  shaded  because  there  are  no  significant  divergences  in 
readings  between  the  different  models  of  machines. 
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Corollary  l.a — Findings 

We  find  limited  support9  for  Corollary  l.a.  Readings  from  spectrophotometers  pro¬ 
duced  by  different  manufacturers  do  diverge  from  one  another,  but  this  divergence 
is  never  over  0.5  on  any  of  the  CIELAB  measurements.  Perhaps  the  most  important 
difference  among  the  machines  is  the  dispersion  in  their  readings.  Aleph  machines 
at  different  sites  showed  a  tendency  to  produce  divergent  readings  of  the  solid  color 
samples.  Gimel  machines  showed  the  greatest  tendency  to  produce  divergent  read¬ 
ings  of  Air  Force  Grey  and  Black  472. 

Corollary  l.b 


Corollary  l.b:  Spectrophotometers  produced  by  different  manufacturers 
will  produce  different  values  of  AL*,  Aa*,  and  Ab*. 

Table  3-13  displays  means  and  standard  deviations  for  the  Air  Force  Grey  and  Black 
472  samples  by  spectrophotometer  manufacturer.  Our  earlier  findings  for  absolute 
CIELAB  scores  cannot  be  extended  to  the  difference  scores  of  AL*,  Aa*,  and  Ab*. 
The  only  significantly  divergent  readings  we  see  in  this  table  are  the  AL*  scores  for 
Air  Force  Grey  from  the  Daleth  machine,  which  was  the  only  mean  difference  score 
to  be  greater  than  1.0.  Overall,  we  find  limited  support  for  Corollary  l.b. 

Table  3-13.  AL*,  Aa*,  and  Ab*  by  Spectrophotometer  Brand  and  Type 

of  Material  Sampled 


Black  472 

AL* 

Aa* 

Ab* 

-0.531 

-0.015 

0.115 

0.397 

0.072 

0.158 

-0.618 

0.028 

0.019 

0.342 

0.120 

0.152 

-0.469 

0.020 

0.133 

0.607 

0.100 

0.174 

-0.673 

-0.009 

0.136 

0.383 

0.049 

0.140 

Air  Force  Grey 

Scanner  maker 

AL* 

Aa* 

Ab* 

Aleph  mean 

0.041 

0.279 

0.041 

Aleph  standard  deviation 

1.232 

0.387 

1.232 

Daleth  mean 

1.556 

0.130 

0.862 

Daleth  standard  deviation 

1.343 

0.408 

0.511 

Gimel  mean 

0.372 

0.180 

0.817 

Gimel  standard  deviation 

1.571 

0.542 

0.734 

Beth  mean 

-0.038 

0.274 

0.569 

Beth  standard  deviation 

1.262 

0.415 

0.474 

Note:  Each  number  is  an  average  of  observations  taken  by  spectrophotometers  of  that  brand.  There  were  161 
observations  for  Aleph,  18  for  Daleth,  126  for  Gimel,  and  36  for  Beth.  Means  are  shaded  in  yellow  if  absolute  value 
of  the  average  difference  value  was  greater  than  1 . 


9  When  we  test  a  hypothesis  or  corollary,  we  are  seeking  evidence  to  determine  whether  or  not 
this  hypothesis  is  true  or  not  true.  If  our  evidence  leads  us  to  believe  that  the  hypothesis  is  true,  we 
say  that  the  evidence  supports  the  hypothesis.  If  our  evidence  does  not  lead  us  to  believe  that  the 
hypothesis  is  true,  then  we  say  that  evidence  does  not  support  the  hypothesis. 
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Corollary  l.b — Findings 

Overall,  we  find  very  little  support  for  Corollary  l.b. 

Corollary  2. a 


Corollary  2. a:  Spectrophotometers  with  different  lens  aperture  sizes  will 
produce  readings  of  L*,  a*,  and  b*  that  correlate  differently. 

The  primary  purpose  of  our  data  collection  from  sample  readings  of  Air  Force 
Grey  and  Black  472  was  to  test  Hypothesis  2.  However,  this  data  collection  also 
provided  us  with  an  opportunity  to  test  Corollaries  2. a  and  2.b  because  these  sam¬ 
ples  were  measured  using  two  different  types  of  aperture  size.  The  Air  Force  Grey 
samples  were  measured  using  a  small  aperture  size  and  the  Black  472  samples  were 
measured  using  a  large  aperture  size.  This  was  not  a  perfect  test  of  Corollaries  2. a 
and  2.b  because  we  cannot  rule  out  the  possibility  that  different  levels  of  correlation 
among  these  sets  of  reasons  are  affected  by  differences  in  the  samples  themselves. 
We  have  already  found,  for  example,  that  the  standard  deviations  of  CIELAB 
scores  were  higher  for  Air  Force  Grey  samples  than  for  Black  472  samples. 

Figure  3-18,  Figure  3-19,  Figure  3-20,  Figure  3-21,  Figure  3-22,  and  Figure  3-23 
show  the  correlation  matrices  for  L*,  a*,  and  b*  CIELAB  scores  for  the  Air  Force 
Grey  and  Black  472  fabric  samples.  These  correlation  matrices  are  similar  to  the 
earlier  figures  that  showed  the  difference  CIELAB  scores  for  the  Air  Force  Grey 
and  Black  472  fabric  samples.  All  the  CIELAB  scores  for  Air  Force  Grey  are 
more  highly  correlated  than  the  CIELAB  scores  for  Black  472. 
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Figure  3-18.  Correlations  of  L*  Scores  for  Air  Force  Grey 
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Figure  3-19.  Correlations  of  a*  Scores  for  Air  Force  Grey 
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Figure  3-20.  Correlations  of  b*  Scores  for  Air  Force  Grey 
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Figure  3-21.  Correlations  of  L*  Scores  for  Black  472 
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Figure  3-22.  Correlations  of  a*  Scores  for  Black  472 
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Figure  3-23.  Correlations  of  b*  Scores  for  Black  472 
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Corollary  2. a — Findings 

We  find  some  support  for  Corollary  2. a.  Spectrophotometers  taking  measure¬ 
ments  with  a  small  aperture  size  do  report  more  highly  correlated  CIELAB  meas¬ 
ures  than  spectrophotometers  taking  measurements  with  a  large  aperture  sizes. 
We  must  qualify  our  support  for  this  corollary  by  noting  that  the  small  and  large 
aperture  sizes  were  used  on  two  entirely  different  fabric  sample  sets. 


3-31 


Corollary  2.b 


Corollary  2.b:  Spectrophotometers  with  different  lens  aperture  sizes  will 
produce  readings  of  AL*,  Aa*,  and  Ab*  that  correlate  differently. 

Our  test  of  corollary  2.b  is  the  same  as  our  test  of  Corollary  2. a.  The  correlation 
matrices  we  used  to  test  this  corollary  are  in  Figure  3-9,  Figure  3-10,  Figure  3-11, 
Figure  3-12,  Figure  3-13,  and  Figure  3-14.  These  figures  show  the  correlation 
matrices  for  AL*,  Aa*,  and  Ab*  readings  of  the  Air  Force  Grey  and  Black 
472  samples. 

Corollary  2.b — Findings 

The  scores  from  the  Air  Force  Grey  samples  were  consistently  more  highly  corre¬ 
lated  than  the  scores  for  the  Black  472  samples.  Recall  that  the  Air  Force  Grey 
samples  were  measured  with  small  lens  aperture,  while  the  Black  472  samples 
were  measured  with  a  large  lens  aperture.  As  a  result,  our  findings  for  corollary 
2.b  are  the  same  as  our  findings  for  corollary  2. a. 

Supplemental  Analysis — Does  Testing 
Change  the  L*,  a*,  and  b*  Values  of  Samples? 

During  our  study,  we  found  b*  readings  tended  to  decay  with  repeated  measure¬ 
ments  for  specific  samples. 10  We  examined  if  the  tendency  was  statistically  sig¬ 
nificant  and  if  it  occurred  on  all  CIELAB  axes.  We  examined  the  CIELAB 
readings  from  the  Desert  Sand  503  fabric  to  test  the  hypothesis  that  spectrophoto¬ 
meter  readings  decay  over  time. 

Figure  3-24  through  Figure  3-26  indicate  the  results  of  repeated  L*,  a*,  and  b*  mea¬ 
surements  of  the  Desert  Sand  503  fabric.  There  was  little  change  in  the  L*  and  a* 
measurements,  but  b*  measurements  to  appear  to  decay  after  the  first  measurement. 


10 


The  Natick  specialists  noticed  the  pattern  in  b*  readings  while  testing  Desert  Sand  503. 
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Figure  3-24.  Desert  Sand  L*  Readings 


|  ♦  Aleph  M  Daleth  — A— Gimel  •  Beth  | 


Figure  3-25.  Desert  Sand  a*  Readings 


^^~Aleph  ■  Daleth  — *—  Gimel  •  Beth 
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Figure  3-26.  Desert  Sand  b  *  Readings 


|  ♦  1  Aleph  M  Daleth  — A— Gimel  •  Beth~| 


Reading 


After  visually  inspecting  the  data,  our  next  step  was  to  test  if  there  was  an  over¬ 
all  downward  trend  in  L*,  a*,  or  b*  readings  of  the  Desert  Sand  503  fabric. 
Table  3-14  shows  the  correlation  coefficients11  we  obtained  between  the  number 
of  observations  and  the  L*,  a*,  and  b*  measurements.  All  of  the  correlation 
coefficients  are  tiny,  and  none  reaches  statistical  significance. 

Table  3-14.  Bivariate  Correlations 
between  Observations  and  CIELAB  Measures 


L* 

a* 

b* 

Observation  number 

-0.004 

-0.036 

-0.059 

Note:  Each  correlation  estimated  from  340  cases. 


While  there  is  no  evidence  of  an  overall  trend  toward  decreasing  L*,  a*,  or  b* 
measurements,  there  is  still  the  possibility  that  the  difference  between  the  first 
b*  reading  and  subsequent  b*  readings  could  be  statistically  significant.  As  we 
see  from  Figure  3-26,  this  decrease  appears  particularly  evident  in  readings  from 
Aleph  machines.  To  test  the  possibility  of  a  difference  between  first  and  subse¬ 
quent  readings,  we  conducted  t-tests  comparing  first  readings  and  subsequent 
readings.  We  conducted  separate  t-tests  for  machines  produced  by  different 

11  Correlation  coefficients  can  range  from  -1  to  1.  A  correlation  coefficient  of  0  indicates  that 
there  is  no  relationship  between  two  variables.  A  correlation  coefficient  of  1  or  -1  means  the  value 
of  one  variable  can  be  perfectly  predicted  from  the  value  of  another  variable.  An  example  of  two 
variables  with  a  correlation  of  1  would  be  the  same  temperature  recorded  in  Fahrenheit  and  Celsius. 
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manufacturers.  The  results  are  shown  on  Table  3-15.  These  results  indicate  that 
the  Aleph,  Daleth,  and  Beth  machines  all  had  statistically  significant  differences 
between  their  first  b*  readings  and  their  subsequent  b*  readings. 

Table  3-15.  Mean  First  and  Subsequent  b*  Readings  by  Manufacturer 


Manufacturer 

Mean  first  b* 
reading 

Mean  subsequent 
b*  readings 

Mean  decrease 

Aleph 

10.640 

10.329 

0.311 

Gimel 

10.491 

10.395 

0.097 

Daleth 

10.600 

10.373 

0.227 

Beth 

10.515 

10.334 

0.181 

Note:  A  total  of  18  readings  were  taken  from  each  machine.  The  first  readings  from 
two  of  the  Aleph  machines  were  missing  from  the  data  set.  Values  for  mean  decrease 
shaded  in  yellow  were  statistically  significant. 


Summary  of  Findings  in  Phase  2 


Overall,  we  find  support  for  Hypothesis  1  and  Hypothesis  2.  Spectrophotometer 
readings  tend  to  be  consistent  across  sites,  equipment  manufacturers,  and  spectro¬ 
photometer  aperture  sizes.  Other  findings  from  Phase  2  include  the  following: 

♦  Site  Rho  consistently  reported  readings  for  L*  and  b*  for  all  color  samples 
that  diverged  significant  from  the  scores  reported  from  all  other  sites;  how¬ 
ever,  the  AL*,  Aa*,  and  Ab*  readings  from  site  Rho  did  not  significantly 
differ  from  the  AL*,  Aa*,  and  Ab*  readings  from  the  other  sites. 

♦  Sites  Omega  and  Omicron  reported  AL*  and  Ab*  readings  that  were  sig¬ 
nificantly  different  from  the  AL*  and  Ab*  readings  reported  by  other  sites. 
The  greater  dispersion  of  readings  from  site  Omicron  was  mostly  due  to  a 
single  aberrant  reading. 

♦  The  brand  of  the  machine  had  a  modest  effect  on  machine  readings.  Dif¬ 
ferent  brands  produced  readings  with  different  dispersion  profiles.  None  of 
the  brands  was  superior  overall  to  the  other  brands  in  terms  of  the  consis¬ 
tency  of  their  readings. 

♦  L*,  a*,  b*,  AL*,  Aa*,  and  Ab*  readings  generated  by  a  machine  with  a 
small  aperture  lens  on  camouflage  fabrics  are  correlated  at  least  as  strongly 
as  L*,  a*,  b*,  AL*,  Aa*,  and  Ab  readings  generated  by  a  machine  with  a 
large  aperture  lens  on  camouflage  fabrics. 

♦  The  first  b*  readings  of  the  color  Desert  Sand  tended  to  be  significantly 
higher  than  subsequent  b*  readings.  This  effect  was  largest  for  the  Aleph 
machines  and  smallest  for  the  Gimel  machines. 
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Phase  3 


In  Phase  2  evidence  was  presented  that  spectrophotometer  measurements  tend  to 
correlate  highly  with  one  another.  In  Phase  3,  comparison  was  made  of  evalua¬ 
tions  of  military  fabrics  based  on  spectrophotometer  measurements  with  evalua¬ 
tions  made  by  human  shade  evaluators.  The  objective  was  to  test  the  following 
hypothesis: 

Hypothesis  3:  There  is  a  correlation  between  spectrophotometer  pass/fail 
evaluation  based  on  instrumental  measurement  and  human  shade  evaluator 
pass/fail  evaluation  based  on  visual  methods.1 

Methodology 


Ten  human  shade  evaluators  from  the  Anny  Natick  Soldier  RDEC  and  the 
DSCC  PTC  were  recruited  for  the  study.  Each  evaluator  was  qualified  using  the 
Farnsworth  Munsell  100  Hue  Test.  The  intent  was  to  recruit  evaluators  with  nor¬ 
mal  color  vision  who  were  not  color  shade  experts.  The  evaluators  reviewed 
10  sets  of  10  colored  fabric  samples.  Since  each  evaluator  made  100  evaluations, 
this  provided  a  data  set  of  1,000  evaluations. 

In  each  assessment,  the  evaluator  was  asked  to  make  a  detennination  as  to  whether 
a  given  fabric  passed  or  failed  based  on  a  given  shade  standard  and  set  of  shade 
tolerances. 

Anny  Natick  Soldier  RDEC  and  DSCC  PTC  provided  an  “official”  detennination 


for  each  fabric  sample  as  to  whether  the  sample  met  the  given  shade  standard  and 
was  acceptable  within  tolerance.  The  fabric  samples  included  the  following: 

♦  Black  385  (Anny) 

♦ 

Anny  Green  415  (Army) 

♦  Blue  3329  (Navy) 

♦ 

Green  489  (Army) 

♦  Blue  3346  (Navy) 

♦ 

Sage  Green  1590  (Air  Force) 

♦  Blue  3376  (Navy) 

♦ 

MC  2312  (Marines) 

♦  Blue  3386  (Navy) 

♦ 

Pewter  2246  (Marines). 

1  Use  of  the  industry  standard  criteria  of  dE  CMC  (a  modification  of  the  CIELAB  equation  to 
represent  shade  variation  as  one  value)  2:1  of  1.0  is  implied  by  the  hypothesis. 
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Percentage  Agreement — Humans 
with  “Official”  Determination 


Table  4-1  shows  what  percentage  of  the  time  the  pass/fail  decisions  given  by  each 
of  the  evaluators  agreed  with  the  “official”  pass/fail  decision.  We  obtained  these 
percentages  by  adding  the  number  of  times  a  human  evaluator  and  the  official  de¬ 
termination  both  gave  a  “pass”  to  a  given  fabric  sample  to  the  number  of  times 
that  a  human  evaluator  and  the  “official”  determination  both  gave  a  “fail”  to  a 
given  fabric  sample.  We  then  divided  that  result  by  the  total  number  of  pass/fail 
decisions  (in  this  case  100).  The  results  in  Table  4-1  indicate  a  range  of  agree¬ 
ment,  from  a  low  of  65  percent  to  a  high  of  77  percent. 

Table  4-1.  Agreement  on  Pass/Fail  Decision  of  Human  Shade 
Evaluators  with  the  “Official”  Determination  of  Pass/Fail 


Agreement 

percentage 

Phi  measure 

Evaluator  1 

70% 

0.41 0a 

Evaluator  2 

76% 

0.519a 

Evaluator  3 

69% 

0.373a 

Evaluator  4 

70% 

0.369a 

Evaluator  5 

73% 

0.431a 

Evaluator  6 

65% 

o 

ho 

00 

o, 

Evaluator  7 

77% 

0.518a 

Evaluator  8 

77% 

0.526a 

Evaluator  9 

71% 

0.386a 

Evaluator  10 

75% 

0.476a 

Median  among  evaluators 

72% 

0.421 

Mean  among  evaluators 

72.3% 

0.429 

a  Indicates  relationship  is  statistically  significant. 


These  results  suggest  a  moderate  level  of  agreement  between  the  human  evaluators 
and  the  “official”  detennination,  but  they  are  not  definitive.  This  is  because  the  per¬ 
centage  of  agreement  is  not  completely  reliable  as  a  measure  of  the  strength  of  the 
relationship  between  two  dichotomous  variables  (in  this  case,  the  two  sets  of 
pass/fail  decisions).  The  reason  is  the  agreement  percentage  does  not  give  us  any 
information  about  why  two  parties  agree.  For  example,  suppose  we  were  looking  at 
the  strength  of  the  relationship  of  pass/fail  decisions  between  two  human  evalua¬ 
tors.  Suppose  that  one  evaluator  passed  fabrics  75  percent  of  the  time  after  carefully 
scrutinizing  the  fabrics  while  another  evaluator  passed  all  of  the  fabrics  after  only  a 
cursory  glance.  The  resulting  agreement  percentage  would  be  75  percent,  even 
though  there  was  no  relationship  between  the  standards  that  the  two  evaluators  were 
using  to  judge  the  fabrics.  In  fact,  if  both  evaluators  merely  flip  a  coin  to  decide 
whether  a  fabric  passes,  they  should  still  agree  about  50  percent  of  the  time. 
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Relationship  among  Human  Evaluator 
Pass/Fail  Decisions  Using  the  Phi  Coefficient 


To  confirm  our  findings  about  agreement  percentage  among  the  human  shade 
evaluators,  we  turn  to  the  phi  coefficient.  The  phi  coefficient  can  vary  between  -1 
and  1  in  the  same  fashion  as  the  more  widely  known  r  coefficient.  If  the  phi  coef¬ 
ficient  between  two  sets  of  pass/fail  decisions  was  1,  the  pass/fail  decisions 
agreed  with  one  another  every  single  time.  If  the  phi  coefficient  between  two  sets 
of  pass/fail  decisions  was  -1,  the  pass/fail  decisions  disagreed  with  one  another 
every  single  time.  If  the  phi  coefficient  between  the  two  sets  of  pass/fail  decisions 
was  zero,  this  would  mean  knowing  one  pass/fail  decision  would  give  you  no  in¬ 
dication  whatsoever  about  what  the  other  pass/fail  decision  was  likely  to  be.  For 
example,  if  both  sets  of  pass/fail  decisions  used  a  coin  flip  to  detennine  whether 
color  samples  met  the  standard,  then  we  would  expect  the  phi  coefficient  to  be 
zero. 

2 

The  standard  that  we  used  to  judge  the  phi  coefficients  is  as  follows: 

♦  -1 .0  to  -0.5,  strong  negative  association 

♦  -0.49  to  -0.3,  moderate  negative  association 

♦  -0.29  to  -0.1,  small  negative  association 

♦  -0.09  to  0.09,  no  association 

♦  0.1  to  0.29,  small  positive  association 

♦  0.3  to  0.49,  moderate  positive  association 

♦  0.5  to  1.0,  strong  positive  association. 

The  phi  coefficients  expressing  the  relationship  between  the  pass/fail  decisions  of 
the  human  participants  and  the  pass/fail  decisions  generated  by  the  spectrophoto¬ 
meter  are  shown  in  the  third  column  of  Table  4-1.  All  of  the  coefficients  are  sta¬ 
tistically  significant  and  positive.  Small  associations  are  highlighted  in  blue, 
moderate  associations  are  highlighted  in  yellow,  and  strong  associations  are  hig¬ 
hlighted  in  red.  All  but  one  of  the  associations  is  moderate  or  strong.  The  phi  cor¬ 
relations  track  very  closely  with  the  pass/fail  agreement  percentages,  thus 
confirming  that  the  agreement  percentages  that  show  the  relationship  between 
human  pass/fail  decisions  and  spectrophotometer  pass/fail  decisions  are  valid. 


2  Jessica  Steele,  Choosing  the  Correct  Statistical  Test,  www.radford.edii/~jcsteele/Choosing 
the  Correct  Statistical  Test,  2006. 
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Overall,  we  find  that  the  mean  agreement  between  the  human  pass/fail  decisions 
and  the  spectrophotometer  pass/fail  decisions  is  72.3  percent.  The  median  percen¬ 
tage  of  agreement  is  72  percent.  The  degree  of  closeness  between  the  median  and 
the  mean  percent  agreement  tells  us  that  the  agreement  percentages  are  evenly 
clustered  around  72  percent.  A  visual  inspection  of  the  data  confirms  this.  Five  of 
the  agreement  percentages  are  below  72  percent  and  live  of  the  agreement  percen¬ 
tages  are  above  72  percent.  Similarly,  the  mean  and  median  phi  statistics  0.429 
and  0.421  are  also  close  to  one  another,  and  the  phi  statistics  are  evenly  divided 
above  and  below  the  mean  and  median  phi  statistics. 

Comparing  the  Instrumentation 
and  the  “Official”  Determination 


After  comparing  the  individual  human  evaluator  pass/fail  decisions  to  the  “offi¬ 
cial”  pass/fail  decisions,  our  next  step  was  to  see  how  pass/fail  decisions  gener¬ 
ated  by  a  spectrophotometer  compared  to  the  official  pass/fail  decisions.  To 
generate  the  spectrophotometer  pass/fail  decision,  we  used  a  gauge  based  on 
dE  CMC  tolerance.  If  the  dE  CMC  score  of  the  sample  fabric  differed  by  more 
than  1.0  from  the  dE  CMC  score  of  the  fabric  designated  as  the  standard,  then 
the  fabric  sample  was  given  a  “fail”  score. 

Table  4-2  shows  the  comparisons  of  the  agreement  percentage  scores  and  the  phi 
coefficients  between  pass/fail  decisions  of  the  human  evaluators,  the  spectropho¬ 
tometer,  and  the  official  pass/fail  determination.  The  agreement  percentage  and 
phi  coefficient  for  the  spectrophotometer  are  statistically  indistinguishable  from 
the  mean  and  median  agreement  percentage  and  phi  coefficient  for  the  human 
evaluators.  In  other  words,  the  spectrophotometer  performed  as  well  as  the  aver¬ 
age  human  evaluator  in  terms  of  how  well  its  pass/fail  decision  compared  to  the 
“official”  pass/fail  determination. 

Table  4-2.  Percentage  of  Agreement  between  Human  Evaluators, 
Instrumentation,  and  the  “Official’’  Determination 


Agreement 

percentage 

Phi 

dE  CMC  with  “official”  determination 

71% 

0.425a 

Median  human  agreement  with  “official”  determination 

72% 

0.421 

Mean  human  agreement  with  “official”  determination 

72.3% 

0.429 

a  Indicates  relationship  is  statistically  significant. 
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Phase  3 


Is  There  a  Pattern  between  “Official" 
Determination  and  Instrumentation- 
Generated  Pass/Fail  Decisions? 


Table  4-3  is  a  crosstab  between  the  official  and  spectrophotometer-generated 
pass/fail  decisions.  The  “official”  determination  was  much  more  likely  than  the 
spectrophotometer — 41  percent  versus  14  percent — to  fail  fabric  samples.  In 
fact,  there  was  only  one  instance  when  the  spectrophotometer  passed  a  fabric 
that  was  failed  by  the  official  determination.  On  the  other  hand,  there  were 
28  instances  when  the  official  determination  failed  a  fabric  that  was  passed  by 
the  spectrophotometer. 


Table  4-3.  “Official”  Determination  and  Spectrophotometer  Pass/Fail  Decisions 


Official  decision 

Spectrophotometer  decision 

Total 

Fail 

Pass 

Fail 

13 

28 

41 

Pass 

1 

58 

59 

Total 

14 

86 

100 

When  the  principal  military  shade  evaluators  at  Anny  Natick  Soldier  RDEC  and 
the  DSCC  PTC  made  the  official  pass/fail  decisions,  they  also  provided  a  reason 
for  why  the  fabric  sample  was  not  acceptable.  This  gives  us  an  opportunity  to  test 
if  there  is  a  relationship  between  the  reason  for  the  fail  decision  and  the  spectro¬ 
photometer’s  pass/fail  decision.  For  example,  perhaps  fabric  samples  official  eva¬ 
luators  judged  “too  full”  were  more  likely  to  be  passed  by  the  spectrophotometer 
than  were  fabric  samples  judged  “too  red.” 

Table  4-4  shows  the  reason  for  the  official  fail  decision  and  the  instrumentation 
pass/fail  decision.  The  chi-square  for  this  crosstab  is  0.407,  indicating  that  we 
have  no  verification  to  suppose  that  there  is  any  evidence  for  a  relationship.  In 
order  to  conclude  that  we  have  evidence  of  a  relationship,  the  chi-square  value 
would  need  to  be  below  0.05.  As  a  result,  we  have  no  basis  for  concluding  that 
instrumentation  will  agree  with  officials  for  some  types  of  fail  decision  more  of¬ 
ten  than  for  other  types. 
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Table  4-4.  Reasons  Given  by  Officials  for  Rejecting  Fabric  Samples 
by  Instrumentation  Pass/Fail  Decision 


Official  reason 
for  rejection 

Spectrophotometer  decision 

Total 

Pass 

Fail 

Too  full 

7 

3 

10 

Too  thin 

1 

0 

1 

Too  yellow 

3 

3 

6 

Too  blue 

3 

0 

3 

Too  red 

0 

1 

1 

Too  green 

3 

0 

3 

Thin  blue 

0 

1 

1 

Thin  red 

2 

2 

4 

Thin  green 

1 

0 

1 

Full  yellow 

2 

2 

4 

Full  blue 

2 

0 

2 

Full  red 

4 

1 

5 

Total 

28 

13 

41 

Summary  of  Findings  in  Phase  3 


We  find  there  is  correlation  between  spectrophotometer  pass/fail  decisions  based  on 
instrumental  measurement  and  human  evaluator  pass/fail  decisions  concerning 
whether  solid  fabrics  meet  acceptable  shade  standards. 

♦  When  individual  human  shade  evaluators  made  pass/fail  decisions  con¬ 
cerning  whether  solid  color  fabrics  met  shade  standards,  they  agreed  with 
the  official  detennination  between  65  percent  and  77  percent  of  the  time. 

♦  The  pass/fail  detennination  generated  by  spectrophotometers  agreed  with 
the  official  determination  71  percent  of  the  time.  This  rate  is  comparable 
to  the  success  rates  for  the  human  evaluators.  (We  validated  the  above  per¬ 
centages  with  the  appropriate  statistical  procedures.) 

♦  The  official  pass/fail  visual  determination  was  more  likely  than  the  spec¬ 
trophotometer  to  determine  which  samples  did  not  meet  pass  criteria. 

♦  There  is  no  evidence  of  a  relationship  between  reasons  given  by  officials 
for  detennining  that  a  sample  did  not  meet  standards  and  whether  or  not 
the  spectrophotometer  generated  a  failing  score  for  that  sample. 
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Appendix  A 

Ancillary  Study  Report 


This  report  describes  the  purpose,  process,  and  findings  of  the  ancillary  study  por¬ 
tion  of  the  Customer  Drive  Uniform  Manufacture  (CDUM)  program.  The  purpose 
of  the  ancillary  study  was  to  provide  insight  on  the  variability  of  shade  evaluation 
labs  used  in  both  industry  and  government  facilities.  The  ancillary  study  was  con¬ 
ducted  to  detennine  if  factors  other  than  instrumentation  were  causing  variability 
in  readings  of  fabric  shade  parameters.  Questions  related  to  ancillary  factors,  in¬ 
cluding  illumination,  environment,  sample  submittal,  and  spectrophotometer  use, 
were  used  to  collect  data  from  each  lab.  The  data  was  then  compiled  into  table 
format  for  comparison.  The  tables  illustrated  the  level  of  variation  for  each  ancil¬ 
lary  factor. 

Some  industry  and  government  facilities  have  multiple  labs  on  site.  The  labs  ob¬ 
served  in  this  study  are  used  for  the  purpose  of  evaluating  fabric  or  material  for 
lot  acceptance  or  rejection  and  adherence  to  shade  specifications. 

Process 


The  ancillary  study  was  conducted  by  observing  laboratory  conditions,  measuring 
lab  dimensions,  and  interviewing  lab  and  production  personnel.  Representatives 
from  LMI  and  the  U.S.  Army  Soldier  Research,  Development  and  Engineering 
Center  at  Natick,  MA,  conducted  site  visits  between  September  2008  and 
April  2009  to  14  labs.  The  labs  visited  were  as  follows: 

♦  Duro 

♦  Mount  Vernon 

♦  Crystal  Springs 

♦  Defense  Supply  Center,  Philadelphia 

♦  Gore 

♦  U.S.  Navy 

♦  Brittany 

♦  Kenyon 

♦  Polartec 
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♦  Bondcote 

♦  Burlington  Finishing  Plant 

♦  Burlington  Raeford 

♦  Milliken 

♦  Carlisle. 

During  the  site  visits,  LMI  and  Anny  Natick  Soldier  RDEC  representatives 
observed  the  conditions  of  the  labs  used  for  shade  evaluation,  and  production  fa¬ 
cilities  were  toured  where  applicable.  Each  site  visit  started  with  a  meeting  be¬ 
tween  the  LMI  representative,  Anny  Natick  Soldier  RDEC  representative,  and  the 
site’s  host  (usually  a  member  of  management).  After  introductions,  the  purpose  of 
the  visit  was  discussed.  The  host  outlined  the  agenda  for  the  visit,  and  then  the  lab 
and  production  areas  were  toured. 

The  tours  usually  started  in  the  shade  evaluation  lab.  The  LMI  representative 
asked  the  lab  employees  several  questions  related  to  the  ancillary  factors.  The 
questions  and  factors  are  listed  in  the  questionnaire  shown  as  Figure  A-l.  The 
questionnaire  was  used  during  all  site  visits. 

The  lab  employees  answered  the  questions,  and  the  LMI  representative  docu¬ 
mented  their  answers.  When  the  lab  employees  could  not  answer  a  question,  a  co¬ 
worker  or  member  of  management  was  called  in  to  assist.  Employees  of  the  pro¬ 
duction  area  were  sometimes  called  in  to  assist  with  answering  questions  related 
to  sample  submittal.  In  some  cases,  answers  to  questions  were  not  known  by  ei¬ 
ther  the  individuals  being  interviewed  or  any  of  their  peers.  In  these  cases,  the  an¬ 
swers  were  left  blank  on  the  questionnaire  and  tables. 

The  condition  of  the  lab  was  assessed  by  the  LMI  representative  without  input 
from  any  of  the  other  parties. 

After  gathering  the  necessary  data,  the  LMI  representative,  the  Anny  Natick  Sol¬ 
dier  RDEC  representative,  and  site  host  toured  the  production  area. 
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Ancillary  Study  Report 


Figure  A-1.  Questions  and  Factors  Asked  to  Lab  Personnel 


Ancillary  Questions 
Illumination  Conditions 

1)  What  illuminant  is  being  used?  (i.e. ,  Daylight  75) 

2)  What  is  the  brand  and  make  of  the  illuminant? 

3)  How  long  since  illuminant  was  changed? 

4)  When  was  the  illuminant  simulator  last  tested  or  calibrated?  How  often? 


Viewing  Environment 

5)  Is  a  shade  table  or  lightbox  being  used? 

6)  What  are  the  dimensions  of  the  room?  (length,  width,  height) 

7)  What  are  the  colors  of  the  walls  and  flooring? 

8)  If  gray,  is  it  equivalent  to  Munsell  N5  thru  N8? 

9)  Is  the  environment  climate  controlled?  (i.e.,  central  air,  localized  air,  climate  control) 

10)  Is  option  A,  B,  C,  or  D  from  AATCC  evaluation  procedure  9  being  used? 

1 1 )  What  is  the  angle  of  the  viewing  table? 

12)  What  is  the  distance  of  the  table  to  the  illuminant  source? 

13)  What  is  the  condition  of  the  lab?  (i.e.,  overall  cleanliness) 


Sample  Submittal 

14)  How  are  lot  samples  generated  for  selection? 

15)  How  are  they  stored/handled  until  selection? 

16)  What  is  the  time  period  from  sample  generation  to  selection? 

17)  How  are  selected  lot  samples  handled  after  selection? 

18)  How  long  between  lot  sample  selection  and  shipment  to  evaluation  lab? 

19)  How  are  selected  lot  samples  sent  to  evaluation  lab? 


Other 

20)  How  are  spectrophotometers  used  in  the  manufacturing  process? 


Findings 


The  three  primary  factors  observed  for  the  study  were  illumination,  view  envi¬ 
ronment,  and  sample  submittal.  Descriptions  of  each  factor  and  the  data  collected 
are  contained  below.  Infonnation  on  spectrophotometer  brand  and  software  is  also 
provided. 

Discrete  values  are  displayed  for  some  factors;  a  range  is  shown  for  others.  The 
distinction  between  discrete  value  and  range  was  based  upon  the  most  effective 
way  to  display  the  data. 
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Illumination 


Illumination  is  the  light  source  used  to  illuminate  the  fabric  while  it  is  evaluated 
for  adherence  to  specifications  by  the  human  eye.  The  brand,  change,  and  calibra¬ 
tion  schedules  were  observed  for  the  illuminants. 

Figure  A-2  displays  the  observations  for  illumination.  Note  that  the  numbers  in  pa¬ 
rentheses  following  the  descriptors  indicate  the  number  of  labs  for  which  that  de¬ 
scriptor  applies.  For  example,  eight  labs  use  the  Spectralight  III  brand  of  illuminant. 

Figure  A-2.  Observations  of  Illumination 


Illumination 

Brand 

Chanqe  Schedule 

Gretag  MacBeth  (1) 

As  needed  (7) 

Spectralight  (1) 

Scheduled  preventive 

Spectralight  II  (1) 
Spectralight  III  (8) 

maintenance  (6) 

EX  (1) 

Calibration 

Annually  (7) 

Sylvania 

Daylight  Delux  (1) 

Multiple  per  year  (4) 

View  Environment 

View  environment  describes  the  type  of  device  being  used  to  evaluate  fabric  (a 
shade  table  or  lightbox),  the  square  footage  of  the  lab,  the  colors  of  the  lab  walls 
and  flooring,  the  type  of  climate  control  the  lab  utilizes,  which  American  Associa¬ 
tion  of  Textile  Chemists  and  Colorists,  or  AATCC,  option  the  lab  follows  when 
evaluating  fabric,  the  angle  of  the  table  on  which  the  fabric  rests  for  evaluation, 
lab  cleanliness,  and  the  distance  from  the  table’s  surface  to  the  illuminant. 

Figure  A-3  displays  the  different  options  available  for  shade  evaluation  under 
AATCC  Evaluation  Procedure  9.  Figure  A-4  displays  the  observations  for  view 
environment. 
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Ancillary  Study  Report 


Figure  A-3.  Options  for  Shade  Evaluation  under  AATCC 
Evaluation  Procedure  9 


Figure  A-4.  Observations  for  View  Environment 


View  Environment* 

Device 

Climate  Control 

Shade  table  (14) 

Central  A/C  (10) 

Local  AC  (2) 

Square  footaqe  (approx.) 

57  to  1056 

AATCC  procedure  9 

Option  A  (10) 

Wail/fioorinq 

Option  B  (1) 

Gray  (8) 

Option  D  (2) 

Gray/white  (1) 

Procedure  similar  to  AATCC  (1) 

Tan/green/black  (1) 

Tan/gray  (1) 

Table  anqle  (approx.) 

Gray/red  (1) 

0  to  45  degrees 

Gray/brown  (2) 

Distance  to  illuminant  (approx.) 

29.5"  to  49" 

*We  observed  cleanliness  ranging  from  very  clean,  average  to  cluttered 

Sample  Submittal 


Sample  submittal  describes  the  type  of  packaging  used  to  hold  the  fabric  samples 
and  the  time  between  when  a  sample  is  generated  and  when  it  is  selected  for  re¬ 
view  by  the  govermnent  representative.  Figure  A- 5  displays  the  observations  for 
sample  submittal. 
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Figure  A-5.  Observations  for  Sample  Submittal 


Sample  Submittal 

Packaging 

Cardboard  box  (3) 

Plastic  bag  (5) 

Envelope  (3) 

Rubber  band  (1) 

Sample  generation  to  selection 

1  day  to  1  week 


Spectrophotometer 


A  spectrophotometer  is  an  instrument  used  by  lab  personnel  to  evaluate  fabric 
shade  against  a  given  standard.  Some  labs  use  spectrophotometers  in  conjunction 
with  human  evaluation  to  judge  whether  fabric  meets  shade  specifications.  The 
spectrophotometer  brands  and  associated  software  documented  observed  during 
the  ancillary  study  are  listed  in  Figure  A-6. 

Figure  A-6.  Spectrophotometer  Brands 


Spectrophotometer 

Brand 

Software 

Datacolor 

Datacolor  tools  1 .1 .0  (1 ) 

Colortools  QC  3.95  (1) 

Colortools  QC  (1) 

Datacolor  tools  1 .0.2  (1 ) 

Datacolor  tools  1.3  (1) 

Colortools  QC  3.1  (2) 

Chroma  QC  (1) 

Colortools  QC  3.0  (1) 

Hunterlab 

Easy  Match  QC  3.72.00  (2) 

Easy  Match  QC  3.84.00(1) 

Color  icontrol  4.11.002  (1) 

Easy  Match  QC  4.03.00(1) 

Universal  4.01  (1) 

Easy  Match  QC  3.90.00  (1 ) 

X  Rite 

X  Rite  Colormaster  8.0.3  (1) 

Color  iQC  v.6.0  (1) 

Gretag  MacBeth 

Gretag  Quality  Control  System  Version  329  (1) 
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Appendix  B 

Data  Collection  Protocol 

Data  Collection  Protocol  for  Ceramic  Tiles 

♦  The  ceramic  tiles  were  clean  and  free  of  lint  and  fingerprints. 

♦  Using  the  green  tile,  the  spectrophotometer  was  calibrated  to  the  manufac¬ 
turer’s  specifications  before  measurements  were  taken. 

♦  The  following  parameters  were  used  for  measuring  the  ceramic  tiles: 

>  Illuminant:  D65 

>  Area  view:  Large 

>  Observer:  10° 

>  Mode:  Specular  included — Reflectance 

>  Ultraviolet  (UV)  filter:  Set  to  OUT 

>  UV  lamp:  Set  to  OFF 

>  Scale:  CIELAB  Absolute  values. 

♦  The  tiles  were  as  follows: 

>  Pale  Grey  >  Mid  Grey 

>  Deep  Grey  >  Green 

>  White  >  Blue. 

♦  Each  tile  was  measured  in  the  same  area,  with  the  same  orientation,  to  col¬ 
lect  the  L*,  a*,  and  b*  values. 

♦  The  absolute  L*,  a*,  and  b*  values  were  recorded  for  each  tile  sample. 
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Data  Collection  Protocol  for  Different 
Shade  Fabric  Samples 

We  used  AATCC  Evaluation  Procedure  6,  Instrumental  Color  Measurement, 
Section  2,  “Measurement  of  Color  by  Reflectance  Methods,”  as  a  guideline  for 
the  different  shade  fabric  samples. 

♦  The  samples  were  clean  and  free  of  lint,  and  were  allowed  to  condition 
if  necessary. 

♦  Using  the  green  tile,  the  spectrophotometer  was  properly  calibrated  to  the 
manufacturer’s  specifications  before  measurements  were  taken. 

♦  The  following  parameters  were  used  for  measuring  the  fabric  samples: 

>  Illuminant:  D65 

>  Area  view:  Large 

>  Observer:  10° 

>  Mode:  Specular  Included — Reflectance 

>  UV  filter:  Set  to  OUT 

>  UV  lamp:  Set  to  OFF 

>  Scale:  CIELAB  Absolute  values. 

♦  The  samples  were  single-layer  samples  with  a  black  tile  as  the  backing. 

♦  The  samples  were  as  follows: 

>  Coyote  498  >  Desert  Sand  503  >  Army  Green  49 1 . 

♦  Each  sample  was  measured  in  the  same  area,  with  the  same  orientation,  to 
collect  the  L*,  a*,  and  b*  values. 

♦  The  absolute  L*,  a*,  and  b*  values  were  recorded  for  each  fabric  sample. 
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Data  Collection  Protocol 


Data  Collection  Protocol  for  Solid  Shade 
on  Different  Fabric  Samples 

We  used  AATCC  Evaluation  Procedure  6,  Instrumental  Color  Measurement, 
Section  2,  “Measurement  of  Color  by  Reflectance  Methods”  as  a  guideline  for 
the  same  shade  on  different  fabrics. 

♦  The  samples  were  clean  and  free  of  lint,  and  were  allowed  to  condition 
if  necessary. 

♦  Using  the  green  tile,  the  spectrophotometer  was  properly  calibrated  to  the 
manufacturer’s  specifications  before  measurements  were  taken. 

♦  The  following  parameters  were  used  for  measuring  the  fabric  pairs: 

>  Illuminant:  D65 

>  Area  view:  Large 

>  Observer:  10° 

>  Mode:  Specular  included — Reflectance 

>  UV  filter:  Set  to  OUT 

>  UV  lamp:  Set  to  OFF 

>  Scale:  CIELAB  Absolute  values. 

♦  The  samples  were  single-layer  samples  with  a  black  tile  as  the  backing. 

♦  The  samples  were  as  follows: 

>  Blue  450  p/w  serge  >  Blue  450  poly 

>  Blue  450  p/w  elastique  >  Blue  450  p/w  tropical. 

♦  Each  sample  was  measured  in  the  same  area,  with  the  same  orientation,  to 
collect  the  L*,  a*,  and  b*  values. 

♦  The  absolute  L*,  a*,  and  b*  values  for  each  fabric  sample  were  recorded. 
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Data  Collection  Protocol  for  Fabric  Pairs 

We  used  AATCC  Evaluation  Procedure  6,  Instrumental  Color  Measurement, 

Section  2,  “Measurement  of  Color  by  Reflectance  Methods,”  and  the  AATCC 
Test  Method  173-2005  CMC,  Calculation  of  Small  Color  Differences  for  Accep¬ 
tability,  as  guidelines  for  the  color  measurement  of  fabric  pairs. 

♦  The  samples  were  clean  and  free  of  lint,  and  were  allowed  to  condition 
if  necessary. 

♦  Using  the  green  tile,  the  spectrophotometer  was  properly  calibrated  to  the 
manufacturer’s  specifications  before  measurements  were  taken. 

♦  The  following  parameters  were  used  for  measuring  the  fabric  pairs: 

>  Illuminant:  D65 

>  Area  view:  Large 

>  Observer:  10° 

>  Mode:  Specular  included — Reflectance 

>  UV  filter:  Set  to  OUT 

>  UV  lamp:  Set  to  OFF 

>  Scale:  CIELAB  and  CMC  ratio  2:1. 

♦  The  samples  were  single-layer  samples  with  a  black  tile  as  the  backing. 

♦  The  samples  were  as  follows: 

>  Blue  3329  standard  and  sample  >  Black  472  standard  and  sample 

>  Blue  3372  standard  and  sample  >  CG  483  standard  and  sample 

>  Purple  3905  standard  and  sample  >  FG  504  standard  and  sample. 

♦  Each  “standard”  and  sample  was  measured  in  the  same  area,  with  the  same 
orientation,  to  collect  the  de  CMC,1  AL*,  Aa*,  and  Ab*  values. 

♦  The  de  CMC,  AL*,  Aa*,  and  Ab*  values  for  each  color  were  recorded. 


1  de  CMC  is  a  summary  of  the  L*,  a*,  and  b*  data. 
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Data  Collection  Protocol 


Data  Collection  Protocol  for  Camouflage 
Fabric  Samples 

We  used  AATCC  Evaluation  Procedure  6,  Instrumental  Color  Measurement, 
Section  2,  “Measurement  of  Color  by  Reflectance  Methods,”  as  a  guideline  for 
the  color  measurement  of  camouflage  fabric  samples  (six  Air  Force  ABU  samples 
and  six  Anny  ACU  samples). 

♦  The  samples  were  clean  and  free  of  lint,  and  were  allowed  to  condition 
if  necessary. 

♦  Using  the  green  tile,  the  spectrophotometer  was  properly  calibrated  to 
manufacturer’s  specifications  before  measurements  were  taken. 

♦  The  following  parameters  were  used  for  measuring  the  camouflage 
fabric  samples: 

>  Illuminant:  D65 

>  Area  view:  Small 

>  Observer:  10° 

>  Mode:  Specular  included — Reflectance 

>  UV  filter:  Set  to  OUT 

>  UV  lamp:  Set  to  OFF 

>  Scale:  CIEFAB  Absolute  values. 

♦  The  samples  were  single-layer  samples  with  a  black  tile  as  the  backing. 

♦  The  samples  were  identified  as  follows: 


> 

Anny  ACU  A 

> 

Air  Force  ABU  A 

> 

Anny  ACU  B 

> 

Air  Force  ABU  B 

> 

Anny  ACU  C 

> 

Air  Force  ABU  C 

> 

Anny  ACU  D 

> 

Air  Force  ABU  D 

> 

Anny  ACU  E 

> 

Air  Force  ABU  E 

> 

Anny  ACU  F 

> 

Air  Force  ABU  F. 

Each  sample  was  measured  in  the  same  area,  with  the  same  orientation,  to  collect 
the  F*,  a*,  and  b*  values. 
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Appendix  C 

Abbreviations 


AATCC 

ABU 

ACU 

ANOVA 

C&T 

CDUM 

CIE 

CIELAB 

DLA 

DSCC 

DSCP 

HQ 

IPT 

MC 

PTC 

RDEC 

SPSS 


American  Association  of  Textile  Chemists  and  Colorists 

airman  battle  unifonn 

Anny  combat  unifonn 

analysis  of  variance 

Clothing  and  Textiles 

Customer  Driven  Uniform  Manufacturing 

Commission  Internationale  d’ Eclair age 
(International  Commission  on  Illumination) 

CIE  L*a*b* 

Defense  Logistics  Agency 
Defense  Supply  Center  Columbus 
Defense  Supply  Center  Philadelphia 
Headquarters 
Integrated  Product  Team 
Marine  Corps 
Product  Testing  Center 

Research,  Development,  and  Engineering  Center 
Statistical  Package  for  the  Social  Sciences 
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